• DocumentCode
    2730482
  • Title

    Frequent items mining on data stream using hash-table and heap

  • Author

    Shan, Zhang ; Ling, Chen ; Li, Tu

  • Author_Institution
    Dept. of Comput. Sci., Yang Zhou Univ., Yangzhou, China
  • Volume
    1
  • fYear
    2009
  • fDate
    20-22 Nov. 2009
  • Firstpage
    141
  • Lastpage
    145
  • Abstract
    Most of the existing algorithms for mining frequent items on data stream do not emphasis the importance of the recent data items. We present an algorithm to detect the items with frequency counts exceeding a user-specified threshold. Our algorithm uses a hash table L and a heap to record the potential frequent items, and can detect ¿-approximate frequent data items on data stream using O(|L|+ ¿-1) memory space and the processing time for each data item is O(log¿-1). Experimental results on several artificial and real datasets show our algorithm has higher precision, requires less memory and consumes less computation time than other similar methods.
  • Keywords
    computational complexity; data mining; data stream; frequent items mining; hash-table; ¿-approximate frequent data items; Area measurement; Computer errors; Computer science; Data mining; Extraterrestrial measurements; Fading; Frequency; Information science; Sampling methods; Space technology; data mining; data stream; frequent items; hash table; heap; time fading model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-4754-1
  • Electronic_ISBN
    978-1-4244-4738-1
  • Type

    conf

  • DOI
    10.1109/ICICISYS.2009.5357918
  • Filename
    5357918