DocumentCode
2955206
Title
A performance model for Forward XPath
Author
Alrammal, Muath ; Hains, Gaétan
Author_Institution
LACL (Lab. d´´Algorithmique, Complexite et Logique), Univ. Paris-Est, Orsay, France
fYear
2012
fDate
2-6 July 2012
Firstpage
595
Lastpage
601
Abstract
XML is a key standard for manipulating data on the Internet. However, querying large volume of XML data represents a bottleneck for several data intensive applications. Many modern applications require processing of massive streams of XML data, creating difficult technical challenges. Among these is the optimization of XPath query processing and accurate cost estimation for these queries when processed on a massive steam of XML data. In this paper, we present a novel performance prediction model which a priori estimates the cost of any Forward XPath structural in terms of space used and time spent. The model consists of (1) a lazy stream-querying algorithm LQ (2) a mathematical performance model (linear regression functions), and (3) a new selectivity estimation technique. Extensive experiments on both real and synthetic data sets show that our model achieves accuracy better than existing approaches. The resulting prototype supports the a priori design of efficient queries, as well as automatic query optimizations.
Keywords
Internet; XML; query processing; Internet; LQ; XML data; data manipulation; forward XPath; query optimizations; query processing; stream-querying algorithm; Accuracy; Algorithm design and analysis; Estimation; Mathematical model; Prediction algorithms; Predictive models; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing and Simulation (HPCS), 2012 International Conference on
Conference_Location
Madrid
Print_ISBN
978-1-4673-2359-8
Type
conf
DOI
10.1109/HPCSim.2012.6266979
Filename
6266979
Link To Document