DocumentCode
3335016
Title
A general and efficient algorithm for “top” queries
Author
Graefe, Goetz
Author_Institution
Hewlett-Packard Labs., Palo Alto, CA
fYear
2008
fDate
7-12 April 2008
Firstpage
548
Lastpage
555
Abstract
"Top K" queries reduce a query result to the most interesting or the most urgent items. In many cases, e.g., when the result size is unbounded due to duplicate key values, a "top" operation cannot be implemented using the standard algorithm based on an in-memory priority queue. The default alternative is a full sort. External merge sort admits multiple novel optimizations specific to "top" operations. These are simple to implement yet greatly reduce the data volume written to runs on temporary storage. Experiments demonstrate substantial performance improvements, in one case exceeding three orders of magnitude.
Keywords
data structures; merging; query processing; sorting; duplicate key values; in-memory priority queue; merge sort; top K query optimizations; Computer aided software engineering; Data mining; Gold; Laboratories; Medals; Memory management; Query processing; Robustness; Scalability; Sorting;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on
Conference_Location
Cancun
Print_ISBN
978-1-4244-2161-9
Electronic_ISBN
978-1-4244-2162-6
Type
conf
DOI
10.1109/ICDEW.2008.4498379
Filename
4498379
Link To Document