• DocumentCode
    1333155
  • Title

    A Prototype for Querying over LZCS Transformed Documents

  • Author

    Adiego, J. ; Navarro, G. ; de la Fuente, P.

  • Author_Institution
    Dept. de Inf., Univ. de Valladolid, Valladolid, Spain
  • Volume
    7
  • Issue
    3
  • fYear
    2009
  • fDate
    7/1/2009 12:00:00 AM
  • Firstpage
    353
  • Lastpage
    360
  • Abstract
    We present novel query algorithms that efficiently support some popular XPath operations over LZCS-transformed documents. The LZCS transformation compresses a redundant XML collection without loss. The main idea of LZCS, inspired by Lempel-Ziv compression, is to replace whole substructures by previous occurrences thereof, and our algorithms try to reuse the work done over those repeating substructures. The algorithms are implemented in a prototype called lzcs-grep. The main advantage of lzcs-grep is that it processes the documents in transformed form, obtaining very fast response times in combination with low memory requirements. Our experimental results show that lzcs-grep is competitive with other XPath processors even over untransformed documents and by far unbeaten when it can operate over their LZCS-transformed version.
  • Keywords
    XML; data compression; query processing; LZCS; Lempel-Ziv compression transformed document querying; XML collection; XPath operation; extensible markup language; lzcs-grep prototype; Data compression; Database languages; Delay; Prototypes; Query processing; Visualization; XML; Data Compression; Database Query Processing; Query Languages;
  • fLanguage
    English
  • Journal_Title
    Latin America Transactions, IEEE (Revista IEEE America Latina)
  • Publisher
    ieee
  • ISSN
    1548-0992
  • Type

    jour

  • DOI
    10.1109/TLA.2009.5336634
  • Filename
    5336634