• DocumentCode
    2725124
  • Title

    Adaptive Frequency Counting over Bursty Data Streams

  • Author

    Lin, Bill ; Ho, Wai-Shing ; Kao, Ben ; Chui, Chun-Kit

  • Author_Institution
    Dept. of Comput. Sci., Hong Kong Univ.
  • fYear
    2007
  • fDate
    March 1 2007-April 5 2007
  • Firstpage
    516
  • Lastpage
    523
  • Abstract
    We investigate the problem of frequent itemset mining over a data stream with bursty traffic. In many modern applications, data arrives at a system as a continuous stream of transactions. In many cases, the arrival rate of transactions fluctuates wildly. Traditional stream mining algorithms, such as Lossy Counting (LC), were generally designed to handle data streams with steady data arrival rates. We show that LC suffers significant loss of accuracy when the data stream is bursty. We propose the Adaptive Frequency Counting algorithm (AFC) to handle bursty data. AFC has a feedback mechanism that dynamically adjusts the mining speed to cope with the changing data arrival rate. Through extensive experiments, we show that AFC outperforms LC under bursty traffics in terms of the accuracy of the set of frequent itemsets
  • Keywords
    data mining; adaptive frequency counting algorithm; bursty data streams; bursty traffic; data stream handling; frequent itemset mining; Automatic frequency control; Computational intelligence; Computer science; Data analysis; Data mining; Electronic mail; Itemsets; Monitoring; Performance analysis; Telecommunication traffic;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0705-2
  • Type

    conf

  • DOI
    10.1109/CIDM.2007.368918
  • Filename
    4221342