Title :
A real-time top-k query algorithm and parallelized implementation
Author :
Yao Lu ; Jun Liu
Author_Institution :
Beijing Key Lab. of Network Syst. Archit. & Convergence, Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
The analysis of data streams is of great value in many fields such as network monitoring and sensor instrumentation. As a common operation, top-k query over data stream is the basis and core of other problems in data stream analysis. In this paper, we introduce a parallel algorithm based on Frequent algorithm and implement it by utilizing Apache Storm. Further, we evaluate the algorithm by estimated error under various situations and show that the algorithm can effectively improve the precision of top-k query by adjusting the parallel degree. The parallelized implementation is of significance in network traffic monitoring.
Keywords :
data analysis; parallel algorithms; query processing; Apache Storm; data stream analysis; network monitoring; network traffic monitoring; parallel algorithm; parallelized implementation; real-time top-k query algorithm; sensor instrumentation; Australia; Fasteners; Hardware; Instruments; Process control; Radiation detectors; Storms; Apache Storm; Parallel algorithm; Top-k query;
Conference_Titel :
Cloud Computing and Intelligence Systems (CCIS), 2014 IEEE 3rd International Conference on
Print_ISBN :
978-1-4799-4720-1
DOI :
10.1109/CCIS.2014.7175722