DocumentCode
2725124
Title
Adaptive Frequency Counting over Bursty Data Streams
Author
Lin, Bill ; Ho, Wai-Shing ; Kao, Ben ; Chui, Chun-Kit
Author_Institution
Dept. of Comput. Sci., Hong Kong Univ.
fYear
2007
fDate
March 1 2007-April 5 2007
Firstpage
516
Lastpage
523
Abstract
We investigate the problem of frequent itemset mining over a data stream with bursty traffic. In many modern applications, data arrives at a system as a continuous stream of transactions. In many cases, the arrival rate of transactions fluctuates wildly. Traditional stream mining algorithms, such as Lossy Counting (LC), were generally designed to handle data streams with steady data arrival rates. We show that LC suffers significant loss of accuracy when the data stream is bursty. We propose the Adaptive Frequency Counting algorithm (AFC) to handle bursty data. AFC has a feedback mechanism that dynamically adjusts the mining speed to cope with the changing data arrival rate. Through extensive experiments, we show that AFC outperforms LC under bursty traffics in terms of the accuracy of the set of frequent itemsets
Keywords
data mining; adaptive frequency counting algorithm; bursty data streams; bursty traffic; data stream handling; frequent itemset mining; Automatic frequency control; Computational intelligence; Computer science; Data analysis; Data mining; Electronic mail; Itemsets; Monitoring; Performance analysis; Telecommunication traffic;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
Conference_Location
Honolulu, HI
Print_ISBN
1-4244-0705-2
Type
conf
DOI
10.1109/CIDM.2007.368918
Filename
4221342
Link To Document