• DocumentCode
    2955206
  • Title

    A performance model for Forward XPath

  • Author

    Alrammal, Muath ; Hains, Gaétan

  • Author_Institution
    LACL (Lab. d´´Algorithmique, Complexite et Logique), Univ. Paris-Est, Orsay, France
  • fYear
    2012
  • fDate
    2-6 July 2012
  • Firstpage
    595
  • Lastpage
    601
  • Abstract
    XML is a key standard for manipulating data on the Internet. However, querying large volume of XML data represents a bottleneck for several data intensive applications. Many modern applications require processing of massive streams of XML data, creating difficult technical challenges. Among these is the optimization of XPath query processing and accurate cost estimation for these queries when processed on a massive steam of XML data. In this paper, we present a novel performance prediction model which a priori estimates the cost of any Forward XPath structural in terms of space used and time spent. The model consists of (1) a lazy stream-querying algorithm LQ (2) a mathematical performance model (linear regression functions), and (3) a new selectivity estimation technique. Extensive experiments on both real and synthetic data sets show that our model achieves accuracy better than existing approaches. The resulting prototype supports the a priori design of efficient queries, as well as automatic query optimizations.
  • Keywords
    Internet; XML; query processing; Internet; LQ; XML data; data manipulation; forward XPath; query optimizations; query processing; stream-querying algorithm; Accuracy; Algorithm design and analysis; Estimation; Mathematical model; Prediction algorithms; Predictive models; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Simulation (HPCS), 2012 International Conference on
  • Conference_Location
    Madrid
  • Print_ISBN
    978-1-4673-2359-8
  • Type

    conf

  • DOI
    10.1109/HPCSim.2012.6266979
  • Filename
    6266979