DocumentCode :
2120963
Title :
InterJoin: Exploiting Indexes and Materialized Views in XPath Evaluation
Author :
Phillips, Derek ; Zhang, Ning ; Ilyas, Ihab F. ; Özsu, M. Tamer
Author_Institution :
Waterloo Univ., Ont.
fYear :
0
fDate :
0-0 0
Firstpage :
13
Lastpage :
22
Abstract :
XML has become the standard for data exchange for a wide variety of applications, particularly in the scientific community. In order to efficiently process queries on XML representations of scientific data, we require specialized techniques for evaluating XPath expressions. Exploiting materialized views in query processing significantly enhances query processing performance. We propose a novel view definition that allows for intermediate (structural) join results to be stored and reused in XML query evaluation. Unlike current XML view proposals, our views do not require navigation in the original document or path-based pattern matching. Hence, they are evaluated significantly faster and are easily costed as part of a query plan. In general, current structural joins cannot exploit views efficiently when the view definition is not a prefix (or a suffix) of the XPath query. To increase the applicability of our proposed view definition, we propose a novel physical structural join operator called InterJoin. The InterJoin operator allows for joining interleaving XPath expressions, e.g., joining //A//C with //B to evaluate //A//B//C. InterJoin allows for more join alternatives in XML query plans. We propose several physical implementations for InterJoin, including a technique to exploit spatial indexes on the inputs. We give analytic cost models for the implementations so they can be costed in an existing XML query optimizer. Experiments on real and synthetic XML data show significant speed-ups of up to 200% using InterJoin, and speed-ups of up to 400% using our materialized views
Keywords :
XML; database indexing; query processing; scientific information systems; tree data structures; InterJoin; XML query evaluation; XML query optimizer; XPath evaluation; analytic cost models; data exchange; join operator; materialized views; path-based pattern matching; query processing; scientific community; spatial indexes; Application software; Books; Computer science; Interleaved codes; Navigation; Pattern matching; Proposals; Query processing; Spatial indexes; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Scientific and Statistical Database Management, 2006. 18th International Conference on
Conference_Location :
Vienna
ISSN :
1551-6393
Print_ISBN :
0-7695-2590-3
Type :
conf
DOI :
10.1109/SSDBM.2006.31
Filename :
1644293
Link To Document :
بازگشت