Title :
Selectivity estimation for string predicates: overcoming the underestimation problem
Author :
Chaudhuri, Surajit ; Ganti, Venkatesh ; Gravano, Luis
fDate :
30 March-2 April 2004
Abstract :
Queries with (equality or LIKE) selection predicates over string attributes are widely used in relational databases. However, state-of-the-art techniques for estimating selectivities of string predicates are often biased towards severely underestimating selectivities. We develop accurate selectivity estimators for string predicates that adapt to data and query characteristics, and which can exploit and build on a variety of existing estimators. A thorough experimental evaluation over real data sets demonstrates the resilience of our estimators to variations in both data and query characteristics.
Keywords :
query processing; relational databases; data characteristics; query characteristics; real data set; relational databases; selection predicate; selectivity estimation; state-of-the-art technique; string predicates; underestimation problem; Frequency estimation; Relational databases; Resilience; State estimation; Statistics;
Conference_Titel :
Data Engineering, 2004. Proceedings. 20th International Conference on
Print_ISBN :
0-7695-2065-0
DOI :
10.1109/ICDE.2004.1319999