• DocumentCode
    2831380
  • Title

    Efficient Compression and Querying of XML Repositories

  • Author

    Alkhatib, Ramez ; Scholl, Marc H.

  • Author_Institution
    Univ. of Konstanz, Konstanz
  • fYear
    2008
  • fDate
    1-5 Sept. 2008
  • Firstpage
    365
  • Lastpage
    369
  • Abstract
    With the rapidly increasing popularity of XML as a data format, there is a large demand for efficient techniques in storing and querying XML documents. However XML is by nature verbose, due to repeatedly used tags that describe data. For this reason the storage requirements of XML can be excessive and lead to increased costs for data manipulation. Therefore, it seems natural to use compression techniques to increase the efficiency of storing and querying XML data. In this paper, we propose a new approach called SCQX for Storing, Compressing and Querying XML documents. This approach compresses the structure of an XML document based on exploiting repetitive consecutive tags in the structure, and then SCQX stores the compressed XML structure and the data separately in a robust storage structure that includes a set of access support structures to guarantee fast query performance. Moreover, SCQX supports querying of the compressed XML structure directly and efficiently without requiring decompression. An experimental evaluation on sets of XML data shows the effectiveness of our approach.
  • Keywords
    XML; data compression; query processing; XML document querying; XML document storing; XML repositories; data manipulation; storage requirements; Costs; Data structures; Expert systems; Labeling; Query processing; Relational databases; Robustness; Skeleton; Testing; XML; Compact Storage; Compressing; Encoding; Quering; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Application, 2008. DEXA '08. 19th International Workshop on
  • Conference_Location
    Turin
  • ISSN
    1529-4188
  • Print_ISBN
    978-0-7695-3299-8
  • Type

    conf

  • DOI
    10.1109/DEXA.2008.64
  • Filename
    4624743