• DocumentCode
    3453171
  • Title

    A More Accurate Space Saving Algorithm for Finding the Frequent Items

  • Author

    Zhou, Jun ; Chen, Ming ; Xiong, Huan

  • Author_Institution
    Dept. of Comput. Sci., PLAUST, Nanjing, China
  • fYear
    2010
  • fDate
    27-28 Nov. 2010
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The frequent items problem is to process a stream as a stream of items and find all items occurring more than a given fraction of the time. It is one of the most heavily studied problems in data stream mining, dating back to the 1980s. Aiming at higher false positive rate of the Space-Saving algorithm, an LRU-based (Least Recently Used, LRU) improved algorithm with low frequency item pre-eliminated is proposed. Accuracy, stability and adaptability of the improved algorithm have been apparently enhanced. Experimental results indicate that the algorithm can not only be used to find the frequent items, and can be used to estimate the frequency of them precisely. The improved algorithm can be used for online processing both high-speed network packet stream and backbone NetFlow stream.
  • Keywords
    computer networks; data mining; LRU-based improved algorithm; NetFlow stream; data stream mining; frequent items; high-speed network packet stream; space-saving algorithm; stability; Accuracy; Algorithm design and analysis; Classification algorithms; Complexity theory; Heuristic algorithms; Monitoring; Radiation detectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Technology and Applications (DBTA), 2010 2nd International Workshop on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-6975-8
  • Electronic_ISBN
    978-1-4244-6977-2
  • Type

    conf

  • DOI
    10.1109/DBTA.2010.5659027
  • Filename
    5659027