Title :
Histogram methods in query optimization: the relation between accuracy and optimality
Author :
Oommen, B. John ; Rueda, Luis G.
Author_Institution :
Sch. of Comput. Sci., Carleton Univ., Ottawa, Ont., Canada
Abstract :
We have solved the following problem using pattern classification techniques (PCT): given two histogram methods, M/sub 1/ and M/sub 2/, used in query optimization, if the estimation accuracy of M/sub 1/ is greater than that of M/sub 2/, then M/sub 1/ has a higher probability of leading to the optimal query evaluation plan (QEP) than M/sub 2/. To the best of our knowledge, this problem has been open for at least two decades, the difficulty of the problem partially being due to the hurdles involved in the formulation itself. By formulating the problem from a pattern recognition perspective, we use PCT to present a rigorous mathematical proof of this fact, and show some uniqueness results. We also report empirical results demonstrating the power of these theoretical results on well-known histogram estimation methods.
Keywords :
database theory; estimation theory; graphs; optimisation; pattern classification; probability; query processing; estimation accuracy; histogram methods; optimal query evaluation plan; optimality; pattern classification techniques; pattern recognition; probability; query optimization; rigorous mathematical proof; uniqueness results; Books; Computer science; Councils; Database systems; Failure analysis; Histograms; NP-hard problem; Pattern recognition; Query processing;
Conference_Titel :
Database Systems for Advanced Applications, 2001. Proceedings. Seventh International Conference on
Conference_Location :
Hong Kong, China
Print_ISBN :
0-7695-0996-7
DOI :
10.1109/DASFAA.2001.916393