Title :
Efficient single-pass frequent itemsets mining over data streams
Author :
Tan, Jun ; Bu, Yingyong ; Zhao, Haiming
Author_Institution :
Coll. of Comput. & Inf. Eng., Central South Univ. of Forestry & Technol., Changsha, China
Abstract :
Finding frequent itemsets is one of the most important issues in mining data streams for many applications such as web click stream mining, sensor networks, and network traffic analysis. Most prominent algorithms for traditional transaction databases need multiple scans, therefore, they are not suitable for data streams which are continuous, unbounded, usually come with high speed. In this paper, we propose a new single -pass algorithms which use the FP-tree data structure in combination with the IT-matrix technique which greatly reduces the need to traverse FP-trees. The experiment results on synthetic datasets and real datasets show that our proposed algorithm is an efficient method for mining frequent itemsets over data streams.
Keywords :
data mining; tree data structures; FP-tree data structure; IT-matrix technique; Web click stream mining; data stream mining; efficient single-pass frequent itemset mining; network traffic analysis; sensor networks; single -pass algorithms; synthetic datasets; transaction databases; traverse FP-trees; Algorithm design and analysis; Construction industry; Data mining; Data structures; Itemsets; Radiation detectors; Data streams; FP-growth; Frequent itemsets; IT-matrix;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5931-5
DOI :
10.1109/FSKD.2010.5569199