Title :
A Distributed Data Mining Framework Accelerated with Graphics Processing Units
Author :
Tran, Nam-Luc ; Dugauthier, Quentin ; Skhiri, Sabri
Author_Institution :
Euranova R&D, Belgium
Abstract :
In the context of processing high volumes of data, the recent developments have led to numerous models and frameworks of distributed processing running on clusters of commodity hardware. On the other side, the Graphics Processing Unit (GPU) has seen much enthusiastic development as a device for general-purpose intensive parallel computation. In this paper we propose a framework which combines both approaches and evaluates the relevance of having nodes in a distributed processing cluster that make use of GPU units for further fine-grained parallel processing. We have engineered parallel and distributed versions of two data mining problems, the naive Bayes classifier and the k-means clustering algorithm, to run on the framework and have evaluated the performance gain. Finally, we also discuss the requirements and perspectives of integrating GPUs in a distributed processing cluster, introducing a fully distributed heterogeneous computing cluster.
Keywords :
Bayes methods; data mining; graphics processing units; parallel processing; pattern classification; pattern clustering; GPU units; commodity hardware clusters; distributed data mining framework; distributed processing cluster; fine-grained parallel processing; fully distributed heterogeneous computing cluster; general-purpose intensive parallel computation; graphics processing units; high data volume processing; k-means clustering algorithm; naive Bayes classifier; Clustering algorithms; Computational modeling; Data mining; Data models; Distributed databases; Graphics processing units; Parallel processing; GPU; algorithm; data mining; distributed data mining; distributed processing; kmeans; naive bayes; processing;
Conference_Titel :
Cloud Computing and Big Data (CloudCom-Asia), 2013 International Conference on
Conference_Location :
Fuzhou
Print_ISBN :
978-1-4799-2829-3
DOI :
10.1109/CLOUDCOM-ASIA.2013.17