XMLTP/L, XMLTP Light

March 25th, 2003 by Jean-Francois Touchette in

Describing how XMLTP Light works and the environments for which it is suited.
Your rating: None

XMLTP/L, or XMLTP Light, is a lightweight RPC protocol that uses XML to encode the stream of data. XMLTP/L has been designed to do fast RPC calls over an intranet, within an enterprise. More specifically, the first purpose of XMLTP/L is to forward transactions (RPCs) to a database server. But, it also can be used to do method calls to any server that follows the common RPC technique introduced by XML-RPC and older client/server protocols.

The origins of XMLTP/L come from the inherent limitations associated with proprietary client/server protocols: basically, they are proprietary. XMLTP/L is distributed as source code and is licensed under the GNU Library General Public License (GNU LGPL).

Because of the languages used in the current implementation of XMLTP/L, developers who use C, Python and Java should find this article more interesting. But, we expect ports will be made to other scripting and web application languages, such as PHP and Ruby.

The name XMLTP Light evokes the design goals and purposes of this protocol:

  • transport TP Light (see Note 1) remote procedure calls (RPC);

  • use an XML syntax to allow compatibility with various tools, extant or future;

  • have good performance, with modest requirements (therefore, real practical scalability using simple hardware configurations); and

  • have a lightweight and robust implementation.

To achieve its goal of being fast and lightweight, XMLTP/L does not try to do everything and acknowledges the following constraints and limitations as non-goals:

  • it uses a subset of XML (a full-fledge XML parser can parse XMLTP/L, but not the reverse);

  • it is not a universal transport for all datatypes;

  • it is not the most standardized or the most buzzwords-compliant technology to appear now, yesterday or tomorrow; and

  • it is not as easy to install as XML-RPC (which is less than 20KB of code in the Python 2.x standard installation).

XMLTP/L is much more boring than other XML protocols, such as SOAP or XML-RPC, or any newer web services protocols that will appear real-soon-now. But, if it fills someone's need for a non-proprietary, robust and flexible RPC protocol on their intranet, then that would be great.

XMLTP/L should be appealing to people who use stored procedure calls and want more flexibility (like integrating multiple DBMS or building a custom non-SQL dataserver). It works for people who need some level of XML compliance without suffering response time greater than 300 or 500 ms, as is the norm for all the implementations of RPC-over-XML that I have tried so far. I have not tried all of them but enough to convince me to write a custom XML subset parser in Bison and C.

Figure 1. XMLTP/L Server Interconnections

Figure 1 shows a schema of XMLTP/L server inteconnection. Only one dataserver is shown, but it is possible to put two or many behind the RRGX, a gateway that does RPC Routing (more about RRGX below). Also in Figure 1, you can see a small box called XML-RPC or SOAP bridge. This piece does not exist yet. But, it would be rather easy to create such a bridge, as the initial specs for XMLTP/L are similar to those for XML-RPC.

Technical Specifications

The current implementation of XMLTP/L uses plain TCP/IP socket for transport, and the connections are persistent. In other words, once a socket connection is established, it is used for many RPC calls. This situation is different from XML-RPC, where a new HTTP connection (a new TCP/IP connection) typically is made for each RPC call. This is one of the reasons why XMLTP/L is faster than XML-RPC.

Furthermore, the RRGX gateway and the Java classes (RPC call) use connection pools. This technique is another boost for performance when many threads of an application server make many RPC calls.

The lightness of the implementation is a third factor for good speed. XMLTP/L is a simple XML-based protocol, and it supports only four datatypes in the RPC parameters and in the tabular result set(s), which can be returned by the stored procedures. These datatypes are:

  • integer: 32 bits, most likely (Python allows for larger *int*, but XML2SYB uses ANSI "C" 32 bits *int* on GNU/Linux on x386)

  • float: double precision, most likely

  • timestamp: "YYYY-MM-DD HH:MM:SS.mmm"

  • string: max of 255 characters.

For any of these datatypes, a value can be flagged as *Null*. In this case, the data associated with that value should be sent empty by the server, and it should be ignored by the client.

If we look at all the details, a XMLTP/L response contains a return status (always), one or many result set(s) (optional), a messages(s) (optional) and an output parameter(s) (optional).

The XML syntax of both the RPC call and the response in XMLTP/L is lighter than the equivalent in XML-RPC. Here is an exampe of a RPC call in XMLTP/L:

<?xml version="1.0" encoding="ISO-8859-1" ?>
<procCall<proc>RPC_PROC_1</proc>
 <param>
  <name>@out_param</name
  <int>-9999</int>
  <attr>114</attr>
 </param>
 <param>
  <name>@param_1</name>
  <str>adenosine</str>
  <attr>0012</attr>
 </param>
</procCall>

Here's the same RPC call, using the XML-RPC syntax:

<?xml version='1.0'?>
<methodCall><methodName>RPC_PROC_1</methodName>
 <params>
  <param>
  <value>array><data>
   <value><string>@out_param</string></value>
   <value><int>-9999</int></value>
   <value><string>114</string></value>
  </data></array></value>
  </param>
  <param>
  <value><array><data>
   <value><string>@param_1</string></value>
   <value><string>adenosine</string></value>
   <value><string>0012</string></value>
  </data></array></value>
  </param>
 </params>
</methodCall>

The difference in the byte count is even more dramatic when the RPC call has more parameters and also in a response with a large number of rows.

A main design goal of XMLTP/L was to pump a high volume of transactions in and out of a relational database. We wanted some level of vendor independance, specifically so users would suffer less from the vendors' frequent upgrades to their native client-server APIs. Also, we wanted to be able to mix datasources and perform RPC routing in a way that would be invisible to the web application server or to other XMLTP/L clients connected to the RRGX.

XMLTP/L acts like a universal database RPC protocol. The whole intranet can use XMLTP/L for RPC calls, and vendors' proprietary protocols can be isolated on a server near the database itself. The RRGX (RPC Router Gateway for XMLTP/L) can integrate multiple database servers together as if if it was a single server. Be aware that XMLTP/L does not automate synchronisation of transactions on multiple dataservers; each RPC is independent of all other calls.

The RRGX does not use any proprietary client/server APIs. This RRGX allows calling procedures in multiple database servers as it routes the RPCs to the various converter programs, according to the names of the procedures. Those routing rules are defined in the configuration file of the RRGX.

The RRGX needs one converter per database protocol. Such converter programs are external programs built from the vendor's API libraries and the XMLTP/L modules. Currently, XML2SYB (XML to Sybase) is already implemented.

It also is possible to code stored procedures in Python in a server similar to the RRGX, which is derived from the generic gxserver.py module. Many other small features are in the RRGX that make it useful in a real production environment:

  • log of events and messages (error, warning, trace & debug)

  • dynamically adjustable trace level

  • built-in operator commands, such as ps, who, stats and so on

  • gates that can be closed while maintenance is done on the database(s) behind the gateway:

    • RPC gate (global)

    • connection pools gates

    • connection gate (global)

    • application specific gates (one gate for a list of RPC names).

    When a gate is closed, a user-defined message can be sent back to the clients.

  • RPC result interceptions (with passive queueing or active re-forwarding)

Limitations, Trade-offs and Advantages

The choice of only four simple datatypes shows that XMLTP/L does not attemp to solve many of the needs of typical multimedia applications. It probably would be quite easy to support a BLOB (binary large object) datatype, but this is not a priority at this moment.

Another limitation in this first implementation is the currently supported character set is ISO-8859-1 (Latin European languages). But, given the fact that this is a simple protocol (and the fact that the source code is available under the GNU LGPL license), we expect other developers to adapt it to their own cultural needs.

The XML syntax used in our project is much more compact than the one used by XML-RPC as of August 2001, when we started designing and building XMLTP/L. A typical RPC call stream or result stream is about half the size when it is encoded with XMLTP/L, because the tags in XMLTP/L are smaller and fewer.

Technical Details about the Current Implementation

We used four languages--GNU Bison, ANSI C, Python and Java--to do the current implementation. At this time, XMLTP/L can be used by programs written in three languages: Python, C and Java.

The main building blocks of the implementation are:

  • a grammar suitable for GNU Bison (in other words, a Yacc .y source file). This grammar is done in a way that allows a multithread parser.

  • a C/Python module that does parsing in a multithreaded way. That xmltp_gx.so module allows users to write both client and server programs in Python. Furthermore, it currently is used to build a gateway server, gxserver.py, that routes RPC calls according to their names. The gateway gxserver.py (or RRGX) has many other features, such as operator commands and RPC gates, that are not fully described here.

  • a C program, XML2SYB, that converts back and forth between XMLTP/L RPC calls to native Sybase RPC calls (TDS protocol). XML2SYB acts as a XMLTP/L server.

  • a Java JDBC driver (client side of XMLTP/L) that is suitable for use with Tomcat web applications or other Java programs.

Resources

XMLT/P Site

Essential Client/Server Survival Guide, Robert Orfali, Dan Harkey & Jeri Edwards. New York, John Wiley & Sons, Inc., 1994.

"Scale Up with TP Monitors", BYTE, April 1995, pp. 123-128.

Notes

*TP Light*: TP Light is doing light OLTP (on-line transaction processing) by using RPCs to call procedures stored in a SQL database. This could be a Sybase, Oracle, IBM DB2 or any other DBMS.

Jean-Francois Touchette has been developing software for 20 years. Since 1985, he has written several gateways and servers with various protocols. He has been using C and UNIX since that time. When Python is suitable, he prefers it.

__________________________


Special Magazine Offer -- 2 Free Trial Issues!
Receive 2 free trial issues of Linux Journal as well as instant online access to current and past issues. There's NO RISK and NO OBLIGATION to buy. CLICK HERE for offer

Linux Journal: delivering readers the advice and inspiration they need to get the most out of their Linux systems since 1994.

Sorry, offer available in the US only. International orders, click here.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

Re: Slow XMLRPC implementations

On March 26th, 2003 Anonymous says:

Regarding the comments about slow XMLRPC calls: there is no reason the same approach (lightweight XML parser, persistent socket communications, say via HTTP/1.1) shouldn't provide the same response times within standard XMLRPC that you are getting with this system.

Check out XmlRpc++ for an example: http://xmlrpcpp.sourceforge.net

Re: Slow XMLRPC implementations

On April 6th, 2003 jftouchette (not verified) says:

Your point is valid, but, there are other features lacking from XML-RPC:

no login authentication (no "session"), no attribute (size) for a string

parameter or column... At some point, hacking/tuning an existing protocol until

it becomes incompatible with its existing implementations is a worse option

than creating yet another new "RPC" protocol. One should remember that

XMLTP/L first objective is to do fast transaction forwarding over a _LAN_

and that it has no ambition to replace SOAP or XML-RPC, which are

promoted for "web services" usage.

Featured Videos

Email is one of the least private and least secure forms of communication, although few people realize this. MixMaster is one way to allow secure, anonymous communication even over the very public medium of email. This tutorial will get you started with MixMaster quickly and easily.

In case you were wondering about the fun side of Linux World Expo, we thought we'd give you a peek at our shenanigans. We at Linux Journal love what we do so much, that we can't help but have a ball wherever we go.

From the Magazine

September 2008, #173

Feeling a bit like a Thermian? Never give up, never surrender! Someday, you could go from underdog to top dog. Just take a look at a few of the underdogs we highlight in this issue: Mutt, djbdns, Nginix, Gentoo, Xara and the program voted mostly likely to fail just a few years back—Firefox. If Firefox is not radical enough for you, check out Chef Marcel's column for some more alternatives. Having trouble mapping your program data to your relational database? If so, Rueven Lerner shows you some tricks in his At The Forge column.

Need to run GUI applications on your server in the next state? In his Paranoid Penguin column, Mick Bauer shows you how to do it securely. Kyle Rankin keeps hacking and slashing and shows you a few split screen secrets you may not be familiar with. Finally, we all know what happens next February, but only Doc knows what happens afterward.

Read this issue