Title : 
Selectivity estimation for string predicates: overcoming the underestimation problem
         
        
            Author : 
Chaudhuri, Surajit ; Ganti, Venkatesh ; Gravano, Luis
         
        
        
            fDate : 
30 March-2 April 2004
         
        
        
        
            Abstract : 
Queries with (equality or LIKE) selection predicates over string attributes are widely used in relational databases. However, state-of-the-art techniques for estimating selectivities of string predicates are often biased towards severely underestimating selectivities. We develop accurate selectivity estimators for string predicates that adapt to data and query characteristics, and which can exploit and build on a variety of existing estimators. A thorough experimental evaluation over real data sets demonstrates the resilience of our estimators to variations in both data and query characteristics.
         
        
            Keywords : 
query processing; relational databases; data characteristics; query characteristics; real data set; relational databases; selection predicate; selectivity estimation; state-of-the-art technique; string predicates; underestimation problem; Frequency estimation; Relational databases; Resilience; State estimation; Statistics;
         
        
        
        
            Conference_Titel : 
Data Engineering, 2004. Proceedings. 20th International Conference on
         
        
        
            Print_ISBN : 
0-7695-2065-0
         
        
        
            DOI : 
10.1109/ICDE.2004.1319999