DocumentCode
3286317
Title
A New Fast Frequent Itemsets Mining Algorithm Based on Forest
Author
Hu, Jian ; Yang-Li, Xiang
Author_Institution
Sch. of Manage., Harbin Inst. of Technol., Harbin
Volume
2
fYear
2008
fDate
18-20 Oct. 2008
Firstpage
551
Lastpage
555
Abstract
Frequent itemsets mining is an important data mining task. Until now there are many algorithms to mine frequent itemsets, this paper emphatically analyses existing algorithms´ memory structure of transaction database and realization skill, as well as the usage of memory. On the basis, two new data structures UFP-Tree and FP-Forest are designed, which use multi-trees structure to store data and solve the single FP-tree storage large scale database difficulty problem. At the same time, a fast frequent itemsets mining algorithm, F-Fminer is presented, which adopts divide and conquer strategy to mine frequent itemsets for every UFP-Tree with deepth-first searching method, then performs right shift combination operation for the branches in UFP-Tree. In order to reduce the time spending of allocation and deallocation memory, this paper also designed a high performance memory manager. According to experimentation on real data sets, the algorithm has greatly enhanced frequent itemsets mining efficiency.
Keywords
data mining; tree data structures; F-Fminer; FP-Forest; UFP-Tree; allocation memory; data mining; data structures; deallocation memory; deepth-first searching method; fast frequent itemsets mining algorithm; Algorithm design and analysis; Conference management; Costs; Data mining; Data structures; Itemsets; Knowledge management; Memory management; Technology management; Transaction databases; Data Mining; Frequent Itemsets; UFP-Tree;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location
Shandong
Print_ISBN
978-0-7695-3305-6
Type
conf
DOI
10.1109/FSKD.2008.214
Filename
4666177
Link To Document