• DocumentCode
    533012
  • Title

    Querying XML data based on improved prefix encoding

  • Author

    Yang, Yang ; Hai-Ge, Li ; Xin, He

  • Author_Institution
    Comput. Center, Henan Univ., Kaifeng, China
  • Volume
    10
  • fYear
    2010
  • fDate
    22-24 Oct. 2010
  • Abstract
    XML has become the de facto standard of data presentation and exchange on the Web and Internet. Efficiently store and query XML documents have been the primary urgent task. At present, most of XML index and query are based on encoding the XML document tree. XML encoding is that node or edge of XML document tree is encoded with the only encoding according to some method. Through the encoding, structural relation can be judged and the XML document tree needn´t be traversed. So XML structure query is often transformed into structure join calculation. In order to support query especially structure query, all kinds of XML encoding schemes are proposed. And most of XML query are based on prefix encoding schemes. Prefix encoding is based on path and completely keeps the absolute path from root to target node, then it is convenient to judge structural relation. This paper analyzes and studies some existing prefix encoding schemes, which have long codes when node depth is excessive deep and the document tree has more children or more siblings. Then excessively long codes need extra memory space and matching of structural join. So as to enhance encoding performance and query efficiency, and to lessen encoding length, therefore, by analyzing the current prefix encoding schemes, IPE encoding (Improved Prefix Encoding) is proposes in this paper. IPE encoding is an improved XML prefix encoding. With this encoding, we can reduce the length of prefix encoding and conveniently judge structural relation, when XML document tree has more nodes. Based on IPE encoding and inverted index tables, path query algorithm is presented; accordingly query speed is improved effectively.
  • Keywords
    Internet; XML; data structures; electronic data interchange; encoding; indexing; query processing; Internet; World Wide Web; XML data querying; XML document querying; XML document tree encoding; XML index; data exchange; data presentation; improved prefix encoding; inverted index tables; path query algorithm; Encoding; XML; IPE Encoding; Path Query; Prefix Encoding; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Application and System Modeling (ICCASM), 2010 International Conference on
  • Conference_Location
    Taiyuan
  • Print_ISBN
    978-1-4244-7235-2
  • Electronic_ISBN
    978-1-4244-7237-6
  • Type

    conf

  • DOI
    10.1109/ICCASM.2010.5622681
  • Filename
    5622681