Title :
Greedy List Intersection
Author :
Krauthgamer, Robert ; Mehta, Aranyak ; Raman, Vijayshankar ; Rudra, Atri
Author_Institution :
Israel & IBM Almaden, Weizmann Inst., Rehovot
Abstract :
A common technique for processing conjunctive queries is to first match each predicate separately using an index lookup, and then compute the intersection of the resulting row- id lists, via an AND-tree. The performance of this technique depends crucially on the order of lists in this tree: it is important to compute early the intersections that will produce small results. But this optimization is hard to do when the data or predicates have correlation. We present a new algorithm for ordering the lists in an AND- tree by sampling the intermediate intersection sizes. We prove that our algorithm is near-optimal and validate its effectiveness experimentally on datasets with a variety of distributions.
Keywords :
query processing; tree data structures; conjunctive queries; greedy list intersection; index lookup; Database systems; Entropy; Explosions; Histograms; Indexes; Intrusion detection; Multidimensional systems; Query processing; Sampling methods; Statistics;
Conference_Titel :
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-1836-7
Electronic_ISBN :
978-1-4244-1837-4
DOI :
10.1109/ICDE.2008.4497512