• DocumentCode
    3599833
  • Title

    A real-time top-k query algorithm and parallelized implementation

  • Author

    Yao Lu ; Jun Liu

  • Author_Institution
    Beijing Key Lab. of Network Syst. Archit. & Convergence, Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2014
  • Firstpage
    160
  • Lastpage
    164
  • Abstract
    The analysis of data streams is of great value in many fields such as network monitoring and sensor instrumentation. As a common operation, top-k query over data stream is the basis and core of other problems in data stream analysis. In this paper, we introduce a parallel algorithm based on Frequent algorithm and implement it by utilizing Apache Storm. Further, we evaluate the algorithm by estimated error under various situations and show that the algorithm can effectively improve the precision of top-k query by adjusting the parallel degree. The parallelized implementation is of significance in network traffic monitoring.
  • Keywords
    data analysis; parallel algorithms; query processing; Apache Storm; data stream analysis; network monitoring; network traffic monitoring; parallel algorithm; parallelized implementation; real-time top-k query algorithm; sensor instrumentation; Australia; Fasteners; Hardware; Instruments; Process control; Radiation detectors; Storms; Apache Storm; Parallel algorithm; Top-k query;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing and Intelligence Systems (CCIS), 2014 IEEE 3rd International Conference on
  • Print_ISBN
    978-1-4799-4720-1
  • Type

    conf

  • DOI
    10.1109/CCIS.2014.7175722
  • Filename
    7175722