DocumentCode :
2506623
Title :
Navigation- vs. index-based XML multi-query processing
Author :
Bruno, Nicolas ; Gravano, Luis ; Koudas, Nick ; Srivastava, Divesh
Author_Institution :
Columbia Univ., USA
fYear :
2003
fDate :
5-8 March 2003
Firstpage :
139
Lastpage :
150
Abstract :
XML path queries form the basis of complex filtering of XML data. Most current XML path query processing techniques can be divided in two groups. Navigation-based algorithms compute results by analyzing an input document one tag at a time. In contrast, index-based algorithms take advantage of precomputed numbering schemes over the input XML document. We introduce a new index-based technique, index-filter, to answer multiple XML path queries. Index-filter uses indexes built over the document tags to avoid processing large portions of the input document that are guaranteed not to be part of any match. We analyze index-filter and compare it against Y-filter, a state-of-the-art navigation-based technique. We show that both techniques have their advantages, and we discuss the scenarios under which each technique is superior to the other one. In particular, we show that while most XML path query processing techniques work off SAX events, in some cases it pays off to preprocess the input document, augmenting it with auxiliary information that can be used to evaluate the queries faster. We present experimental results over real and synthetic XML documents that validate our claims.
Keywords :
XML; database indexing; information filters; query languages; query processing; SAX events; XML data filtering; XML document; XML path queries; Y-filter; document tags; index-based XML multiquery processing; index-based algorithms; index-filter technique; navigation-based algorithms; numbering schemes; Algorithm design and analysis; Books; Data preprocessing; Database languages; Filtering; Impedance matching; Indexes; Navigation; Query processing; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2003. Proceedings. 19th International Conference on
Print_ISBN :
0-7803-7665-X
Type :
conf
DOI :
10.1109/ICDE.2003.1260788
Filename :
1260788
Link To Document :
بازگشت