DocumentCode
1447045
Title
Continuous Top-k Dominating Queries
Author
Kontaki, Maria ; Papadopoulos, Apostolos N. ; Manolopoulos, Yannis
Author_Institution
Dept. of Inf., Aristotle Univ., Thessaloniki, Greece
Volume
24
Issue
5
fYear
2012
fDate
5/1/2012 12:00:00 AM
Firstpage
840
Lastpage
853
Abstract
Top-k dominating queries use an intuitive scoring function which ranks multidimensional points with respect to their dominance power, i.e., the number of points that a point dominates. The k points with the best (e.g., highest) scores are returned to the user. Both top-k and skyline queries have been studied in a streaming environment, where changes to the data set are very frequent. In such an environment, continuous query processing techniques are required toward efficient monitoring of query results, since periodic query re-execution is computationally intensive, and therefore, prohibitive. This work contains the first study of continuous top-k dominating queries over data streams. In comparison to continuous top-k and skyline queries, continuous top-k dominating queries pose additional challenges. Three exact algorithms (BFA, EVA, ADA) are studied, and among them ADA, which is enhanced with additional optimization techniques, shows the best overall performance. In some cases, we are willing to trade accuracy for speed. Toward this direction, two approximate algorithms are proposed (AHBA and AMSA). AHBA offers probabilistic guarantees regarding the accuracy of the result based on the Hoeffding bound, whereas AMSA performs a more aggressive computation resulting in more efficient processing. Evaluation results, based on real-life and synthetic data sets, show the efficiency and scalability of our techniques.
Keywords
approximation theory; optimisation; probability; query processing; ADA algorithm; AHBA approximate algorithm; AMSA approximate algorithm; BFA algorithm; EVA algorithm; Hoeffding bound; continuous query processing technique; continuous top-k dominating query; data stream; intuitive scoring function; multidimensional point ranking; optimization technique; periodic query reexecution; probabilistic guarantee; query result monitoring; skyline query; Accuracy; Algorithm design and analysis; Approximation algorithms; Approximation methods; Computers; Monitoring; Query processing; Top-k dominating queries; algorithms; analysis; approximation.; continuous queries; data streams;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2011.43
Filename
5710928
Link To Document