DRAFT: This is a work in progress! Read at your own risk; believe at your peril!
Note that this file is not actually checked in yet; I'm still thinking about the best place to put it within the PIA documentation tree.
This document discusses the PIA's architecture, the interfaces (API's) that it supports, and ways of integrating the PIA with other web servers and into other web applications in such a way to preserve cross-platform application portability.
The PIA architecture has three main components (also referred to below as ``features''):
dps
-- the Document Processing System
that implements the tag language. By itself, the DPS essentially
provides a simple but complete ``macro language'' for XML and HTML web
pages.
site
-- the site structure package. This
provides a simple, robust, versatile, and OS-independent way of
structuring the files and directories that comprise a web application.
agents
-- PIA agents provide the ability
to operate (using the DPS) on web requests and documents as they
pass through a PIA-based proxy server. It is worth noting that most
PIA applications will not require agents to operate on proxied
transactions.
There are also four different degrees to which the PIA or its components may be integrated with a web server such as Apache:
The following table lists some interfaces which it is plausible that the PIA might support:
Interface | Difficulty | Features | Priority | Notes |
---|---|---|---|---|
Protocol Interfaces | ||||
AJP | mid | +site +agents | mid | This is the protocol used between Apache and the Java servlet engines; most other high-end web servers also support it. It may also be used by the PHP engine. Code can be obtained from Tomcat or JServ. |
Java Interfaces | For interoperation with existing XML toolsets. | |||
Servlet | low | ?site ~agents | high | Mark did this; it should be easy to resurrect. The Servlet interface supports non-proxied agents. |
DOM | low | -site | high | Already written, but tuning and testing are required. |
SAX | mid | -site | high | The hooks are in for this, but they're totally untested. |
TRAX | ? | -site | low | This is a proposed interface; we'll have to wait until it stabilizes. No information available yet. |
Cocoon 1.x | low | -site | mid | This is essentially a one-liner given DOM. |
Cocoon 2.x | mid | -site | low | Probably simple given SAX and a stable Cocoon 2.0, but it may take a while to get to that point. |
Tomcat Servlet |
mid | +site +agents | low | Tomcat includes hooks for agents. |
C Interfaces | Difficulty ratings assume existance of CPIA. | |||
mod_dps
|
low | -site | high | Should be very simple given C port of DPS. |
mod_site
|
high | -dps +site | high | Does not require DPS; can be done using the XML
parser built into Apache 2.0. It should be possible to leverage
off of the metadata kept by mod_dav .
|
mod_pia
|
mid | +site +agents | mid | This is really just adds agents to
mod_dps and mod_site
|
The features +dps and -agents are assumed
unless otherwise noted. The ``full-featured'' PIA is
+dps +site +agents .
|
|
|
There are two main paths to a ``full-featured'' PIA integrated with Apache:
mod_pia
-- a full-featured PIA
implemented as a set of standard, C, Apache modules. It should
not be necessary to implement a stand-alone PIA server in C.
mod_proxy
-based scheme. Some
investigation is required, mainly to see whether proxying (and hence
agents) can be supported through AJP.
Several intermediate or alternative structures are possible:
mod_rewrite
it might even be possible to
support those as well.
mod_site
+ AJP -- this puts processing
into a separate server, but allows Apache to serve static HTML pages
directly.
mod_perl
-- a PERL implementation of the
PIA would allow a full-featured PIA to run as part of Apache and
interoperate cleanly with existing mod_perl
-based web
applications.
It is likely that the DPS will be useful outside of the PIA. Since the existing code will soon support all of the major Java interfaces (DOM, SAX, Servlet) used in XML applications this use should be encouraged, and may eventually lead to the PIA's tag language becoming a standard, either as an alternative or (less likely) an extension to XSLT.
No effort should be put into alternative implementations (except for C, of course) at this time, but if some outside developer wants to do one they should be encouraged.
mod_perl
is very popular and because most of the machinery
required is already present in the form of PERL modules.
What follows is the implementation roadmap for the PIA's interfaces:
dps
and site
packages
with any servlet-based web server.
org.risource.pia
and
org.risource.content
.
mod_pia
; the C port should be ready at about
this time.
mod_pia
and the Java
interfaces, it should be possible to do a major cleanup and
simplification of the PIA's Java implementation. This would include
eliminating redundant classes and interfaces, lazy expansion of tag
content, and major overhaul of the org.risource.pia
package.