DocumentCode
2141764
Title
A parallel algorithm for frequent itemset mining
Author
Li, Li ; Zhai, Donghai ; Jin, Fan
Author_Institution
Sch. of Comput. & Commun. Eng., Southwest Jiaotong Univ., Chengdu, China
fYear
2003
fDate
27-29 Aug. 2003
Firstpage
868
Lastpage
871
Abstract
Frequent itemsets mining plays an essential role in data mining. A new algorithm PFP-growth (parallel FP-growth), which is based on the improved FP-growth, is proposed for parallel frequent itemset mining. The new algorithm distributes the task fairly among the parallel processors. We devise partitioning strategies at different stages of the mining process to achieve balance between processors and adopt some data structure to reduce the information transportation between processors. The experiments on national high performance parallel computer show that the PFP-growth is an efficient parallel algorithm for mining frequent itemset.
Keywords
data mining; data structures; parallel algorithms; PFP-growth algorithm; data mining; data structure; frequent itemset mining; information transportation; parallel FP-growth algorithm; parallel algorithm; parallel computer; parallel processors; Broadcasting; Data mining; Itemsets; Merging; Parallel algorithms; Transportation;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Computing, Applications and Technologies, 2003. PDCAT'2003. Proceedings of the Fourth International Conference on
Print_ISBN
0-7803-7840-7
Type
conf
DOI
10.1109/PDCAT.2003.1236435
Filename
1236435
Link To Document