Title :
Efficient Maintenance and Mining of Frequent Itemsets over Online Data Streams with a Sliding Window
Author :
Li, Hua-Fu ; Ho, Chin-Chuan ; Shan, Man-Kwan ; Lee, Suh-Yin
Author_Institution :
Nat. Chiao-Tung Univ., Hsinchu
Abstract :
Online mining of streaming data is one of the most important issues in data mining. In this paper, we proposed an efficient one-pass algorithm, called MFI-TransSW (mining frequent itemsets over a transaction-sensitive sliding window), to mine the set of all frequent itemsets in data streams with a transaction-sensitive sliding window. An effective bit-sequence representation of items is used in the proposed algorithm to reduce the time and memory needed to slide the windows. The experiments show that the proposed algorithm not only attain highly accurate mining results, but also run significant faster and consume less memory than existing algorithms for mining frequent itemsets over recent data streams.
Keywords :
data analysis; data mining; data mining; frequent itemset; online data stream; sliding window; Algorithm design and analysis; Computer science; Cybernetics; Data mining; Data structures; Data warehouses; Itemsets; Measurement; Monitoring; Real time systems;
Conference_Titel :
Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
1-4244-0099-6
Electronic_ISBN :
1-4244-0100-3
DOI :
10.1109/ICSMC.2006.385267