Title :
Parallel execution of distributed SVM using MPI (CoDLib)
Author :
Salleh, Nur Shakirah Md ; Suliman, Azizah ; Ahmad, Abdul Rahim
Author_Institution :
Dept. of Syst. & Networking, Univ. Tenaga Nasional, Kajang, Malaysia
Abstract :
Support Vector Machine (SVM) is an efficient data mining approach for data classification. However, SVM algorithm requires very large memory requirement and computational time to deal with very large dataset. To reduce the computational time during the process of training the SVM, a combination of distributed and parallel computing method, CoDLib have been proposed. Instead of using a single machine for parallel computing, multiple machines in a cluster are used. Message Passing Interface (MPI) is used in the communication between machines in the cluster. The original dataset is split and distributed to the respective machines. Experiments results shows a great speed up on the training of the MNIST dataset where training time has been significantly reduced compared with standard LIBSVM without affecting the quality of the SVM.
Keywords :
data mining; message passing; parallel processing; pattern classification; support vector machines; CoDLib; LIBSVM; MNIST dataset; data classification; data mining approach; distributed S VM; distributed computing method; message passing interface; parallel computing method; parallel execution; support vector machine; Computers; Kernel; Machine learning; Message passing; Support vector machines; Testing; Training; Distributed SVM; LIBSVM; Message Passing Interface (MPI); Support Vector Machine (SVM);
Conference_Titel :
Information Technology and Multimedia (ICIM), 2011 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4577-0988-3
DOI :
10.1109/ICIMU.2011.6122723