DocumentCode :
1447045
Title :
Continuous Top-k Dominating Queries
Author :
Kontaki, Maria ; Papadopoulos, Apostolos N. ; Manolopoulos, Yannis
Author_Institution :
Dept. of Inf., Aristotle Univ., Thessaloniki, Greece
Volume :
24
Issue :
5
fYear :
2012
fDate :
5/1/2012 12:00:00 AM
Firstpage :
840
Lastpage :
853
Abstract :
Top-k dominating queries use an intuitive scoring function which ranks multidimensional points with respect to their dominance power, i.e., the number of points that a point dominates. The k points with the best (e.g., highest) scores are returned to the user. Both top-k and skyline queries have been studied in a streaming environment, where changes to the data set are very frequent. In such an environment, continuous query processing techniques are required toward efficient monitoring of query results, since periodic query re-execution is computationally intensive, and therefore, prohibitive. This work contains the first study of continuous top-k dominating queries over data streams. In comparison to continuous top-k and skyline queries, continuous top-k dominating queries pose additional challenges. Three exact algorithms (BFA, EVA, ADA) are studied, and among them ADA, which is enhanced with additional optimization techniques, shows the best overall performance. In some cases, we are willing to trade accuracy for speed. Toward this direction, two approximate algorithms are proposed (AHBA and AMSA). AHBA offers probabilistic guarantees regarding the accuracy of the result based on the Hoeffding bound, whereas AMSA performs a more aggressive computation resulting in more efficient processing. Evaluation results, based on real-life and synthetic data sets, show the efficiency and scalability of our techniques.
Keywords :
approximation theory; optimisation; probability; query processing; ADA algorithm; AHBA approximate algorithm; AMSA approximate algorithm; BFA algorithm; EVA algorithm; Hoeffding bound; continuous query processing technique; continuous top-k dominating query; data stream; intuitive scoring function; multidimensional point ranking; optimization technique; periodic query reexecution; probabilistic guarantee; query result monitoring; skyline query; Accuracy; Algorithm design and analysis; Approximation algorithms; Approximation methods; Computers; Monitoring; Query processing; Top-k dominating queries; algorithms; analysis; approximation.; continuous queries; data streams;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2011.43
Filename :
5710928
Link To Document :
بازگشت