DocumentCode :
2738767
Title :
An Efficient Frequent Pattern Mining Algorithm
Author :
Tan, Jun ; Bu, Yingyong ; Yang, Bo
Author_Institution :
Coll. of Comput. Sci., Central South Univ. of Forestry & Technol., Changsha, China
Volume :
2
fYear :
2009
fDate :
14-16 Aug. 2009
Firstpage :
47
Lastpage :
50
Abstract :
Efficient algorithms for mining frequent itemsets are crucial for mining association rules and for other data mining tasks. FP-growth algorithm has been implemented using a prefix-tree structure, known as a FP-tree, for storing compressed frequency information. Numerous experimental results have demonstrated that the algorithm performs extremely well. But In FP-growth algorithm, two traversals of FP-tree are needed for constructing the new conditional FP-tree. In this paper we present a novel FP-array technique that greatly reduces the need to traverse FP-trees, thus obtaining significantly improved performance for FP-tree based algorithms. The technique works especially well for sparse datasets. We then present a new algorithm which use the FP-tree data structure in combination with the FP-array technique efficiently and get the counts of frequent items from FP-array directly in order to omit the first scanning and save time. Experimental results show that the new algorithm outperform other algorithm in not only the speed of algorithms, but also their memory consumption and their scalability.
Keywords :
data mining; trees (mathematics); data mining tasks; frequent pattern-array technique; frequent pattern-growth algorithm; frequent pattern-tree; mining association rules; mining frequent itemsets; pattern mining algorithm; Association rules; Computer science; Data mining; Data structures; Databases; Educational institutions; Frequency shift keying; Fuzzy systems; Itemsets; Scalability; FP-array; FP-growth algorithm; FP-tree; sparse datasets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
Type :
conf
DOI :
10.1109/FSKD.2009.492
Filename :
5358482
Link To Document :
بازگشت