DocumentCode :
3223981
Title :
Tree Projection-Based Frequent Itemset Mining on Multicore CPUs and GPUs
Author :
Teodoro, G. ; Mariano, N. ; Meira, W. ; Ferreira, R.
Author_Institution :
Dept. of Comput. Sci., Univ. Fed. de Minas Gerais, Belo Horizonte, Brazil
fYear :
2010
fDate :
27-30 Oct. 2010
Firstpage :
47
Lastpage :
54
Abstract :
Frequent itemset mining (FIM) is a core operation for several data mining applications as association rules computation, correlations, document classification, and many others, which has been extensively studied over the last decades. Moreover, databases are becoming increasingly larger, thus requiring a higher computing power to mine them in reasonable time. At the same time, the advances in high performance computing platforms are transforming them into hierarchical parallel environments equipped with multi-core processors and many-core accelerators, such as GPUs. Thus, fully exploiting these systems to perform FIM tasks poses as a challenging and critical problem that we address in this paper. We present efficient multi-core and GPU accelerated parallelizations of the Tree Projection, one of the most competitive FIM algorithms. The experimental results show that our Tree Projection implementation scales almost linearly in a CPU shared-memory environment after careful optimizations, while the GPU versions are up to 173 times faster than standard the CPU version.
Keywords :
computer graphic equipment; coprocessors; data mining; multiprocessing systems; association rules; frequent itemset mining; graphics processing units; high performance computing; many-core accelerators; multicore CPUs; multicore GPUs; multicore processors; tree projection algorithm; Data mining; Graphics processing unit; Instruction sets; Itemsets; Multicore processing; Parallel processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Architecture and High Performance Computing (SBAC-PAD), 2010 22nd International Symposium on
Conference_Location :
Petropolis
ISSN :
1550-6533
Print_ISBN :
978-1-4244-8287-0
Type :
conf
DOI :
10.1109/SBAC-PAD.2010.15
Filename :
5644924
Link To Document :
بازگشت