DocumentCode :
3543598
Title :
gpuDCI: Exploiting GPUs in Frequent Itemset Mining
Author :
Silvestri, Claudio ; Orlando, Salvatore
Author_Institution :
Univ. Ca´´ Foscari Venezia, Italy
fYear :
2012
fDate :
15-17 Feb. 2012
Firstpage :
416
Lastpage :
425
Abstract :
Frequent item set mining (FIM) algorithms extract subsets of items that occurs frequently in a collection of sets. FIM is a key analysis in several data mining applications, and the FIM tools are among the most computationally intensive data mining ones. In this work we present a many-core parallel version of a state-of-the-art FIM algorithm, DCI, whose sequential version resulted, for most of the tested datasets, better than FP-Growth, one of the most efficient algorithms for FIM. We propose a couple of parallelization strategies for Graphics Processing Units (GPU) suitable for different resource availability, and we present the results of several experiments conducted on real-world and synthetic datasets.
Keywords :
data mining; graphics processing units; multiprocessing systems; parallel processing; FIM algorithm; FIM tool; GPU; data mining; frequent itemset mining; gpuDCI; graphics processing unit; many-core parallel version; parallelization strategies; Algorithm design and analysis; Data mining; Data structures; Graphics processing unit; Instruction sets; Itemsets; Synchronization; GP-GPU; cuda; frequent itemset mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel, Distributed and Network-Based Processing (PDP), 2012 20th Euromicro International Conference on
Conference_Location :
Garching
ISSN :
1066-6192
Print_ISBN :
978-1-4673-0226-5
Type :
conf
DOI :
10.1109/PDP.2012.94
Filename :
6169580
Link To Document :
بازگشت