• DocumentCode
    2924955
  • Title

    A Table-Driven Streaming XML Parsing Methodology for High-Performance Web Services

  • Author

    Zhang, Wei ; van Engelen, R.

  • Author_Institution
    Dept. of Comput. Sci., Florida State Univ., Tallahassee, FL
  • fYear
    2006
  • fDate
    18-22 Sept. 2006
  • Firstpage
    197
  • Lastpage
    204
  • Abstract
    This paper presents a table-driven streaming XML parsing methodology, called TDX. TDX expedites XML parsing by pre-recording the states of an XML parser in tabular form and by utilizing an efficient runtime streaming parsing engine based on a push-down automaton. The parsing tables are automatically produced from the XML schemas of a WSDL service description. Because the schema constraints are pre-encoded in a parsing table, the approach effectively implements a schema-specific XML parsing technique that combines parsing and validation into a single pass. This significantly increases the performance of XML Web services, which results in better response time and may reduce the impact of the flash-crowd effect. To implement TDX, we developed a parser construction toolkit to automatically construct parsers in C code from WSDLs and XML schemas. We applied the toolkit to an example Web services application and measured the raw performance compared to popular high-performance parsers written in C/C++, such as eXpat, gSOAP, and Xerces. The performance results show that TDX can be an order of magnitude faster
  • Keywords
    Web services; XML; grammars; pushdown automata; WSDL service description; high-performance Web services; parser construction toolkit; push-down automaton; runtime streaming parsing engine; table-driven streaming XML parsing; Automata; Computer science; Delay; Encoding; Engines; Filters; Production; Runtime; Web services; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Services, 2006. ICWS '06. International Conference on
  • Conference_Location
    Chicago, IL
  • Print_ISBN
    0-7695-2669-1
  • Type

    conf

  • DOI
    10.1109/ICWS.2006.15
  • Filename
    4032028