• DocumentCode
    2210091
  • Title

    A stream-based selectivity estimation technique for forward XPath

  • Author

    Alrammal, Muath ; Hains, Gaétan

  • Author_Institution
    LACL (Lab. d´´Algorithmique, Complexite et Logique), Univ. Paris-Est, Marne-la-Vallée, France
  • fYear
    2012
  • fDate
    18-20 March 2012
  • Firstpage
    209
  • Lastpage
    214
  • Abstract
    The Extensible Markup Language (XML) rapidly establishes itself as the de facto standard for presenting, storing, and exchanging data on the Internet. However, querying large volume of XML data represents a bottleneck for several computationally intensive applications. A fast and accurate selectivity estimation mechanism is of practical importance because selectivity estimation plays a fundamental role in XML query performance. Recently proposed techniques are all based on some forms of structure synopses that could be time-consuming to build and not effective for summarizing complex structure relationships. To overcome this limitation, we propose an innovative selectivity estimation algorithm, which consists of (1) the path tree synopsis data structure, a succinct description of the original document with low computational overhead and high accuracy for processing tasks like selectivity estimation, (2) a streaming selectivity estimation algorithm which is efficient for path tree traversal. Extensive experiments on both real and synthetic data sets show that our technique achieves better accuracy and less construction time than existing approaches.
  • Keywords
    Internet; XML; electronic data interchange; query processing; tree data structures; Internet; complex structure; data exchange; data presentation; data set; data storage; de facto standard; extensible markup language; forward XPath; path tree synopsis data structure; path tree traversal; query processing; streaming selectivity estimation algorithm; task processing; Accuracy; Data structures; Estimation; Grammar; Impedance matching; Internet; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovations in Information Technology (IIT), 2012 International Conference on
  • Conference_Location
    Abu Dhabi
  • Print_ISBN
    978-1-4673-1100-7
  • Type

    conf

  • DOI
    10.1109/INNOVATIONS.2012.6207734
  • Filename
    6207734