DocumentCode
3269883
Title
A flexible infrastructure for gathering XML statistics and estimating query cardinality
Author
Freire, Juliana ; Ramanath, Maya ; Zhang, Lingzhi
fYear
2004
fDate
30 March-2 April 2004
Firstpage
857
Abstract
A key component of XML data management systems is the result size estimator, which estimates the cardinalities of user queries. Estimated cardinalities are needed in a variety of tasks, including query optimization and cost-based storage design; and they can also be used to give users early feedback about the expected outcome of their queries. In contrast to previously proposed result estimators, which use specialized data structures and estimation algorithms, StatiX uses histograms to uniformly capture both the structural and value skew present in documents. The original version of StatiX was built as a proof of concept. With the goal of making the system publicly available, we have built StatiX++, a new and improved version of StatiX, which extends the original system in significant ways. In this demonstration, we show the key features of StatiX++.
Keywords
XML; data structures; query processing; statistical databases; StatiX++ system; XML data management systems; XML statistics; cost-based storage design; data structures; estimation algorithms; histograms; publicly available system; query cardinality estimation; query optimization; result size estimator; Statistics; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2004. Proceedings. 20th International Conference on
ISSN
1063-6382
Print_ISBN
0-7695-2065-0
Type
conf
DOI
10.1109/ICDE.2004.1320085
Filename
1320085
Link To Document