WOAD: Web-Organized Application Development

A Development Environment to Dye For

Note to the Reader:

This document is a general description of the WOAD system. However, it is written not as a tutorial but as a working design sketch. Hence, the exposition follows the design process and includes the designer's musings and the rationale for major design decisions.

Contents


    1: General Description
    2: Operating Modes
    3: Requirements
    4: Features
        4.1: Code Viewing and Understanding
        4.2: Cross-application editing and linking
        4.3: Collaborative Annotation
    5: Use Cases
        5.1: Single-User Development
        5.2: Developer Community: Decentralized
        5.3: Developer Community: Centralized
        5.4: Customization
        5.5: Platform
    6: WOAD Tree Structure
        6.1: Keywords
        6.2: URL Annotation
        6.3: Source Annotation
    7: Implementation Plan
    8: Implementation Notes
        8.1: Filenames, Extensions, Directories
        8.2: Where It Lives
        8.3: What does a WOAD site look like?
        8.4: Project Structure
        8.5: Application Structure
        8.6: File Viewing and Editing
        8.7: Demo Trees
    9: Alternatives
        9.1: Alternative Derivations of WOAD
        9.2: Alternative Acronyms
        9.3: See Also

1: General Description

WOAD stands for Web-Organized Application Development (see here for alternative meanings) -- a web-based application development tehnique, philosophy, and toolset (WOAD Warrior) based on browsing, annotating, and editing the application's source tree using a web browser and specialized codebase server. Putting it another way, the idea is to make an application (or collection of applications) into a website around which the developer community is organized.

Note that this technique is orthogonal, but complementary, to the idea of a ``Web Application'' -- WOAD can be used to organize any application, though it is highly optimized for web applications (e.g., it's easy to go from a web page's source to a browser preview). It is also possible to link a web application with its WOAD server in order to make a tight coupling between a user community and a developer community.

WOAD can be thought of as a kind of ``IDE for the Web'' as well as being a browser-based IDE. The general idea is to build a web-based tree browser, code viewer (with syntax-based color highlighting, as in Emacs font-lock-mode), and documentation extractor (e.g. JavaDoc), with full cross-linking between keywords and documentation and between application web page previews and the corresponding source code. In addition, any page in the application, and any file in the source tree, can be collectively annotated, just like the manual at php.net/. It's sort of like Code Warrior meets SourceForge in a Wiki-Wiki Web.

It's actually slightly more accurate to regard WOAD as a Dis-integrated Development Environment -- it works quite happily alongside whatever editor, version-control system, build manager, web server, upload program, and so on you are presently using in your development.

2: Operating Modes

WOAD can be used in several different ways, depending on whether it is a personal tool or a shared environment, and whether it is separate from or integrated with the application being developed:

3: Requirements

WOAD requires:

WOAD works well with but does not require:

The applications and development tools above are the ones that WOAD is designed to work with ``out of the box'' -- it should be easy to add interfaces to others.

4: Features

4.1: Code Viewing and Understanding

The ``understanding'' part comes from the ability of the code viewer to link every identifier or keyword with its corresponding definition (if it has one) and documentation (if it has any). For example, in the statement:

   String foo = new Table(bar.getSomething())

The idea would be to link String, a system class, directly to its documentation at Sun, Table, a PIA class, to its local source file (which in turn would link to its javadoc documentation), and bar and getSomething to their declarations.

4.2: Cross-application editing and linking

Every file would be linked to, typically, two editors: one web-based and implemented directly in the WOAD server, and one user-specified application (e.g. Emacs). Suitably-equipped editors would have links back to the browser.

Similarly, every file would be viewable both in source form through WOAD, and in final form through a local copy of the target server. The latter would be visible through multiple browsers (e.g. Netscape and Lynx).

In the various WOAD views, there will of course be controls for building derived files, handling version control, updating the website, and so on. These will have to be configurable for different build programs (make and ant), different version control systems (CVS and RCS)), different update techniques (ftp, rsync, scp, ...) and so on.

4.3: Collaborative Annotation

It will be easy to attach annotation to the code, and in some cases to see it (or at least find it) from the resulting web pages. Linking between annotation and code, and vice versa (from the code view to the annotation) will be as simple as possible.

Some applications, of course, will not want to make annotation and developer-feedback links available on the final website, but some (e.g. php.net/) will, and WOAD will make this functionality available too, e.g. by means of PIAServlet.

Two general styles of annotation are required: threaded discussions, which are best for gathering input from a large user community, and collaborative ``anything goes'' web editing, which works best in a small developer community. Text formatting and linking conventions will be the same for both.


5: Use Cases

5.1: Single-User Development

This is the simplest case. I start with a copy of WOAD Warrior on a local machine, and use it either by itself or, more likely, alongside my regular editor to develop a web application.

Another use for WOAD Warrior will be as a tree browser and customizer for imported applications. In other words, I download a source tree and set up a parallel WOAD tree for annotation.

In this mode, WOAD is just a personal tool along the lines of an IDE.

Once the web application is deployed (for example, on an Apache server with PHP), I have the option of deploying all or, more likely, part of my WOAD tree for PHP-style documentation that my users can add to. (See below.)

5.2: Developer Community: Decentralized

This is almost as simple. A development team sets up a CVS repository for a new application. Each team member then uses a personal copy of the WOAD server as a tree browser and editor, occasionally checking their changes back in.

5.3: Developer Community: Centralized

In this case the source tree and the WOAD annotation tree are separate. Ordinary programming is decentralized using CVS, as usual, but everyone shares the same WOAD tree so that annotations can immediately be shared with all other developers.

Even in the centralized case, an individual developer may well run a local copy of WOAD Warrior as a code browser over their personal code tree, and to keep personal notes and personal customizations. The local copy will of course proxy to the shared one.

5.4: Customization

Once an application is deployed, there are suddenly two communities to worry about: the developer community and the user/customizer community. In this case there may be two distinct WOAD trees overlaid on the same source tree: one for the developers and one for the users. It's also likely that the users' view will be restricted; e.g. to only the web pages.

Because WOAD makes it easy to develop specialized, web-based editors for configuration files and web pages, it's a good way to deploy specialized customization tools for an application.

Customization is of course going to be done locally at each installation of the application, so there will be many customization communities springing up, each around its own WOAD server. Each group of users will have to decide on a policy about which annotations and customizations to share with other groups; the simple default is just to keep them local.

Even with a local WOAD server, requests that can't be fulfilled locally will be proxied up to the centralized server so that changes in the shared content will be immediately available. Note that there will need to be ways of locking both WOAD pages and application files so that users trying to customize things don't step on the originals.

5.5: Platform

There will be some kinds of applications that can be implemented directly over a WOAD server: mainly applications that include discussion forums, collaborative editing and annotation, and collaborative development of structured web pages. For example, dmoz, Wiki, and slashdot are examples.

A ``real-world'' WOAD-based application might be part of a larger website, perhaps an intranet.


6: WOAD Tree Structure

There are actually four sets of URL's involved in a Woad-organized application:

It is vitally important that the target and annotation URL trees be isomorphic. This extends as far down as query strings: query strings in the target need to be available on the annotation pages, because annotations may apply to specific queries.

Note that POST requests present a significant problem here -- there is basically no way to recapture them in most cases. They can be recaptured if the target server is Woad-aware and has links to Woad on all of its pages, or if the Woad server is being used as a proxy.

Corresponding to the URL trees above, we have the following parameters that define their relationships. Note that all of the following parameters are expected to start with a ``/'' character and not to end with one.

Note: parameter naming conventions
Prefix - prepended to a target path to make a WOAD path, or the location of a WOAD tree component relative to the WOAD root.
Suffix - appended to a WOAD path to reach a related component.
Offset - location of one part of the WOAD tree relative to another.

6.1: Keywords

Keywords are a bit of a problem. One option is to make them all queries, another is to put them under /.Woad/Words; we don't do either of these, choosing instead to add another prefix, so that the keywords can all be found under /.words.

This is not strictly true at the implementation level: the directory /.Woad/Words does exist, and is simply aliased as /.words -- this makes it possible to include some pre-built headings and indices with the distribution.

Keyword pages can get fairly complex, because a word may be used in several contexts, and these will want separate annotation. This suggests that each word should correspond to a directory. This wins, because it makes it possible to associated arbitrary amounts of information with a word, and there's minimal interference between the different items.

The directories immediately underneath /.words correspond to contexts in which words can be defined. For example, /.words/func contains words declared as functions; /.words/file contains words used as the names of files.

It will probably prove necessary to hash words into subdirectories on their first character (or maybe even their first two characters).

6.2: URL Annotation

Similarly, it sounds as though the best thing to do for annotation may well be to make a directory for every page being annotated. This makes it easy to separate annotation files from ``real'' target pages in a listing: the real pages and real subdirectories map into directories; the annotation maps into files. It also makes it trivial to obtain a specific piece of annotation: simply tack its name onto the URL.

We should handle missing documents by passing the requested URL and pathinfo in a query to the listing file in the last valid directory.

6.3: Source Annotation

Source annotation is a different problem. Making every source file correspond to an annotation directory would be severe overkill, and would make it impossible to distribute annotation together with sources. Hence, we will put source annotation in subdirectories (with the name sourceSuffix) of the source tree.

Note that things are still fairly simple: the source tree is a virtual subtree of the WOAD root, so the annotation will simply hang off the corresponding real subtree. Since the source tree retains its structure, it will be trivial to map filenames into URL's, and relative links in source files will still (usually) work.

It will be possible to use WOAD as a ``pure'' source browser or directory browser: this simply corresponds to the situation where sourcePrefix is null, and there is no target server and no corresponding target annotation tree. Putting the target URL annotations further down in the tree, e.g. hanging off the source document root, is another possibility (for the future), as is putting source and URL annotations on separate servers.

The above implies that the current work is by no means wasted: the situation in the source tree is essentially the same as it is now, only at an offset from the main (target URL annotation) tree.


7: Implementation Plan

  1. Sketch out the basic tree-browsing tagset and configuration files.
  2. HTML/XML viewing, with links to the tagset documentation, included or otherwise referenced files, and to the parallel web-server used for previews. At this point you would have something you can point at a directory tree and browse around with.
  3. Code viewing, which will require parsers and indexers for non-SGML-based languages but otherwise is just like HTML viewing. It may be possible to leverage Emacs ETAGS files here.
  4. Threaded discussions and collaborative web-page creation. (These are closely related because they will use the same text-to-HTML parser, with editing and linking conventions similar to those used in SWiki. Linking will eventually have to evolve to include code and other non-local linkages, which are tricky. Also, it should be possible to make certain linkages without requiring the user to explicitly delimit them (e.g., class names should be recognized anywhere).
  5. CVS history browsing. Possibly using one of the existing CGI's.
  6. Linkage to programs: make, emacs, ...
  7. Web-based editing for text. (Essentially trivial, since you just throw the whole file into a text box.)
  8. Web-based (wizard-like) editing for specialized files such as Makefile's and XML configuration files, and for page fragments such as HTML tables and lists of links. At this point you would have something you could use for maintaining a small web site.
  9. Web-based structured editing for code and XML files, using the same parsers as code viewing. See the old InterLisp editor.
  10. Simplified parser development using XML description files for both syntax and viewing/editing semantics. At this point you have something that can be self-extended to handle any kind of programming language or web-page structure.
  11. Linkages between WOAD-based applications. WOAD restricted to subtrees. This is what you need in order to share information between a developer community and multiple user/customizer groups.

Note that this is only one of many possible orderings: most of the ``steps'' above are independent.


8: Implementation Notes

8.1: Filenames, Extensions, Directories

In most cases it will be necessary to keep WOAD configuration files and so on carefully separate from the application's own files. In particular, we have to stay away from things like HEADER.html, _subsite.xcf, *.xh, Makefile, and so on. This suggests:

There are actually several good ways of doing the annotation; we'll probably want to be able to handle all of them:

8.2: Where It Lives

Possibly the most difficult decision will be where to put WOAD's own source tree -- or even whether it should have one!

After consultation with Greg it looks like the best choice is PIA/Apps/Woad, so here we are.

8.3: What does a WOAD site look like?

The flip side of ``where it lives'' is how you navigate around a WOAD site. My guess is that all of the alternatives below will be wanted, in different situations.

8.4: Project Structure

There are three different cases for a project:

The conclusion is that, in all cases, a project page is just the index page of the project's WOAD tree. This simplifies things.

8.5: Application Structure

So, now that we know what a WOAD site is supposed to look like, how do we structure PIA/Apps/Woad and the PIA root directory (~/.pia) to make it look like that?

There's a bit of a problem, in that Woad can run in several different configurations:

There are also four plausible configurations; they ought to be selectable on a per-project basis:

Getting the real root to point to the source tree may be tricky. It requires either rewriting the _subsite.xcf in Warrior's real root directory (and rereading it, which isn't supported yet), or adding some kind of mount(path) method to Subsite. Getting the virtual root to point to the source should be easier: just create the new real directory and write its _subsite.xcf file. (But does this really work? It's possible that the _subsite.xcf file gets read as soon as the directory is created, at which point we're in trouble.) It's probably simplest just to fix Subsite so that it can reread the config file -- we need to do that anyway.

During initial implementation we can finesse this problem by hand-building the necessary project directories. And we can build it into the top-level woad script for starting the server. It's not a problem for the ThisPIA project, either, because we can just put the necessary stuff into PIA/.

8.6: File Viewing and Editing

The big problem here is that we want to be able to view files in multiple formats. This means that the most obvious thing, just redefining all the tagsets, doesn't work. What we really would like is an automatic wrapper document that does an <include> of the file with the appropriate tagset, based on a query string.

It might be possible to dance around this problem, too, in one of the following almost equally disgusting ways.

All of these schemes get into a certain amount of trouble with images -- sometimes you want them wrapped, but sometimes you want to just see the darned thing. So this requires explicit rewriting in the wrapper.

All schemes have to worry about paths that start with ``/'' -- in other words, ``absolute'' paths that need to be prepended with the server root.

Schemes that modify the path to splice in the viewer, rather than changing the tagset mapping, make it much easier to get previews and view images -- just don't modify the path. The ``trailing /-'' scheme additionally allows the extension map to specify the real tagset for the page, which is a good thing, but it also means that the tree is ``live,'' which may be a very bad thing.

My guess is that the right thing to do is the following:

  1. Start by mapping extensions into tagsets that give a default source view. Use the definition for <html> to fake start and end wrappers.
  2. Add tagset/parser logic to allow a real wrapper to be defined. Probably needs lazy &content; evaluation at the same time to make it work efficiently.
  3. Figure out how to remap /- to a wrapper for directory listing.
  4. Use explicit viewer pages with URL's in query strings to provide additional views and editors. They'll be linked from the wrapper in the main view, so tagset information can be passed at that point.
  5. Implement either the /- mechanism or the pathinfo mechanism (viewer.xh/path to simplify views.

8.7: Demo Trees

The presence of the PIA project wants to be conditional on whether PIA_HOME is the virtual root. Some explicit checking might be in order to see whether one should make an Apache project.


9: Alternatives

9.1: Alternative Derivations of WOAD

9.2: Alternative Acronyms

9.3: See Also


Copyright © 2000 Ricoh Innovations, Inc.
$Id: about.html,v 1.12 2001-01-11 23:36:36 steve Exp $