DocumentCode :
2025990
Title :
Efficient single-pass frequent itemsets mining over data streams
Author :
Tan, Jun ; Bu, Yingyong ; Zhao, Haiming
Author_Institution :
Coll. of Comput. & Inf. Eng., Central South Univ. of Forestry & Technol., Changsha, China
Volume :
3
fYear :
2010
fDate :
10-12 Aug. 2010
Firstpage :
1438
Lastpage :
1441
Abstract :
Finding frequent itemsets is one of the most important issues in mining data streams for many applications such as web click stream mining, sensor networks, and network traffic analysis. Most prominent algorithms for traditional transaction databases need multiple scans, therefore, they are not suitable for data streams which are continuous, unbounded, usually come with high speed. In this paper, we propose a new single -pass algorithms which use the FP-tree data structure in combination with the IT-matrix technique which greatly reduces the need to traverse FP-trees. The experiment results on synthetic datasets and real datasets show that our proposed algorithm is an efficient method for mining frequent itemsets over data streams.
Keywords :
data mining; tree data structures; FP-tree data structure; IT-matrix technique; Web click stream mining; data stream mining; efficient single-pass frequent itemset mining; network traffic analysis; sensor networks; single -pass algorithms; synthetic datasets; transaction databases; traverse FP-trees; Algorithm design and analysis; Construction industry; Data mining; Data structures; Itemsets; Radiation detectors; Data streams; FP-growth; Frequent itemsets; IT-matrix;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5931-5
Type :
conf
DOI :
10.1109/FSKD.2010.5569199
Filename :
5569199
Link To Document :
بازگشت