Title :
Optimizing selections over datacubes
Author :
Ross, Kenneth A. ; Zaman, Kazi A.
Author_Institution :
Columbia Univ., USA
Abstract :
Datacube queries compute aggregates over database relations at a variety of granularities. Often one wants only datacube output tuples whose aggregate value satisfies a certain condition, such as exceeding a given threshold. We develop algorithms for processing a datacube query using the selection condition internally during the computation. Thus, we can safely prune parts of the computation and end up with a more efficient computation of the answer Our first technique, called “specialization”, uses the fact that a tuple in the datacube does not meet the given threshold to infer that all finer level aggregates cannot meet the threshold. Our second technique is called “generalization”, and applies in the case where the actual value of the aggregate is not needed in the output, but used just to compare with the threshold. We demonstrate the efficiency of these techniques by implementing them within the sparse datacube algorithm of Ross and Srivastava. We present a performance study using synthetic and real-world data sets. Our results indicate substantial performance improvements for queries with selective conditions
Keywords :
query processing; aggregates; database relations; datacube queries optimisation; granularities; performance improvements; Aggregates; Databases; Marketing and sales; Medical treatment;
Conference_Titel :
Scientific and Statistical Database Management, 2000. Proceedings. 12th International Conference on
Conference_Location :
Berlin
Print_ISBN :
0-7695-0686-0
DOI :
10.1109/SSDM.2000.869784