DocumentCode :
243747
Title :
Parallel Frequent Pattern Mining without Candidate Generation on GPUs
Author :
Fei Wang ; Bo Yuan
Author_Institution :
Div. of Inf., Tsinghua Univ., Shenzhen, China
fYear :
2014
fDate :
14-14 Dec. 2014
Firstpage :
1046
Lastpage :
1052
Abstract :
The graphics processing unit (GPU) has evolved into a key part of today´s heterogeneous parallel computing architecture. A number of influential data mining algorithms have been parallelized on GPUs including frequent pattern mining algorithms, such as Apriori. Unfortunately, due to two major challenges, the more effective method for mining frequent patterns without candidate generation named FP-Growth has not been implemented on GPUs. Firstly, it is very hard to efficiently build the FP-Tree in parallel on GPUs as it is an inherently sequential process. Secondly, mining the FP-Tree in parallel is also a difficult task. In this paper, we propose a fully parallel method to build the FP-Tree on CUDA-enabled GPUs and implement a novel parallel algorithm for mining all frequent patterns using the latest CUDA Dynamic Parallelism techniques. We show that, on a range of representative benchmark datasets, the proposed GPU-based FP-Growth algorithm can achieve significant speedups compared to the original algorithm.
Keywords :
data mining; graphics processing units; parallel algorithms; parallel architectures; trees (mathematics); CUDA dynamic parallelism technique; FP-tree; GPU; data mining algorithm; graphics processing unit; parallel algorithm; parallel frequent pattern mining; Algorithm design and analysis; Data mining; Graphics processing units; Heuristic algorithms; Instruction sets; Itemsets; Parallel processing; Dynamic Parallelism; FP-Growth; FP-Tree; GPU;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
Type :
conf
DOI :
10.1109/ICDMW.2014.71
Filename :
7022712
Link To Document :
بازگشت