Title :
XSEED: Accurate and Fast Cardinality Estimation for XPath Queries
Author :
Zhang, Ning ; Özsu, M. Tamer ; Aboulnaga, Ashraf ; Ilyas, Ihab F.
Author_Institution :
University of Waterloo
Abstract :
We propose XSEED, a synopsis of path queries for cardinality estimation that is accurate, robust, efficient, and adaptive to memory budgets. XSEED starts from a very small kernel, and then incrementally updates information of the synopsis. With such an incremental construction, a synopsis structure can be dynamically configured to accommodate different memory budgets. Cardinality estimation based on XSEED can be performed very efficiently and accurately. Extensive experiments on both synthetic and real data sets show that even with less memory, XSEED could achieve accuracy that is an order of magnitude better than that of other synopsis structures. The cardinality estimation time is under 2% of the actual querying time for a wide range of queries in all test cases.
Keywords :
Computer science; Cost function; Histograms; Kernel; Relational databases; Robustness; Statistical distributions; Testing; Tree graphs; XML;
Conference_Titel :
Data Engineering, 2006. ICDE '06. Proceedings of the 22nd International Conference on
Print_ISBN :
0-7695-2570-9
DOI :
10.1109/ICDE.2006.178