Title :
PCPSVM: A parallel cutting plane algorithm for training SVMs
Author_Institution :
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou, China
Abstract :
The cutting plane algorithm provides fast training for classification SVMs, but it still suffers from the problem of memory restriction, because the algorithm requires to load all the data to the memory. To overcome this bottleneck, we propose and implement a Parallel Cutting Plane algorithm for training Support Vector Machines (PCPSVM) on distributed computers. The Algorithm uses a row-based storage method to reduce memory requirement and finally can parallelize both data loading and computation. Let l denote the number of training instances, d the dimension of each instance, m the number of machines. We divide the data to m parts, and loads only essential data to each machine to perform parallel computation. The memory requirement can be reduced from O(ld) to O(ld/m). We implement our PCPSVM algorithm in the MPICH platform. Experiments show that the algorithm is effective, memory requirement is reduced and great speed-up is achieved when many processors are used. PCPSVM Open Source is available at http://code.google.com/p/pcpsvm/.
Keywords :
learning (artificial intelligence); parallel algorithms; support vector machines; PCPSVM algorithm; SVM classification; SVM training; parallel computation; parallel cutting plane algorithm; row-based storage method; support vector machines; Computer science; Concurrent computing; Data engineering; Distributed computing; Kernel; Large-scale systems; Machine learning; Machine learning algorithms; Support vector machine classification; Support vector machines; Cutting Plane; Large-Scale Problem; Machine Learning; Parallel Computing; Row-based; SVMs;
Conference_Titel :
Computer Engineering and Technology (ICCET), 2010 2nd International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-6347-3
DOI :
10.1109/ICCET.2010.5485293