Title :
A Table-Driven Streaming XML Parsing Methodology for High-Performance Web Services
Author :
Zhang, Wei ; van Engelen, R.
Author_Institution :
Dept. of Comput. Sci., Florida State Univ., Tallahassee, FL
Abstract :
This paper presents a table-driven streaming XML parsing methodology, called TDX. TDX expedites XML parsing by pre-recording the states of an XML parser in tabular form and by utilizing an efficient runtime streaming parsing engine based on a push-down automaton. The parsing tables are automatically produced from the XML schemas of a WSDL service description. Because the schema constraints are pre-encoded in a parsing table, the approach effectively implements a schema-specific XML parsing technique that combines parsing and validation into a single pass. This significantly increases the performance of XML Web services, which results in better response time and may reduce the impact of the flash-crowd effect. To implement TDX, we developed a parser construction toolkit to automatically construct parsers in C code from WSDLs and XML schemas. We applied the toolkit to an example Web services application and measured the raw performance compared to popular high-performance parsers written in C/C++, such as eXpat, gSOAP, and Xerces. The performance results show that TDX can be an order of magnitude faster
Keywords :
Web services; XML; grammars; pushdown automata; WSDL service description; high-performance Web services; parser construction toolkit; push-down automaton; runtime streaming parsing engine; table-driven streaming XML parsing; Automata; Computer science; Delay; Encoding; Engines; Filters; Production; Runtime; Web services; XML;
Conference_Titel :
Web Services, 2006. ICWS '06. International Conference on
Conference_Location :
Chicago, IL
Print_ISBN :
0-7695-2669-1
DOI :
10.1109/ICWS.2006.15