Title :
A parallel SVM training algorithm on large-scale classification problems
Author :
Zhang, Jian-pei ; Li, Zhong-Wei ; Yang, Jing
Author_Institution :
Coll. of Comput. Sci. & Technol., Harbin Eng. Univ., China
Abstract :
Support vector machine (SVM) has become a popular classification tool but the main disadvantages of SVM algorithms are their large memory requirement and computation time to deal with very large datasets. To speed up the process of training SVM, parallel methods have been proposed by splitting the problem into smaller subsets and training a network to assign samples of different subsets. A parallel training algorithm on large-scale classification problems is proposed, in which multiple SVM classifiers are applied and may be trained in a distributed computer system. As an improvement algorithm of cascade SVM, the support vectors are obtained according to the data samples´ distance mean and the feedback is not the whole final output but alternating to avoid the problem that the learning results are subject to the distribution state of the data samples in different subsets. The experiment results on real-world text dataset show that this parallel SVM training algorithm is efficient and has more satisfying accuracy compared with standard cascade SVM algorithm in classification precision.
Keywords :
data mining; parallel algorithms; pattern classification; support vector machines; alternating feedback; cascade SVM; distributed computer system; large-scale classification problems; multiple SVM classifiers; parallel SVM training algorithm; parallel learning; support vector machine; text dataset; Classification algorithms; Concurrent computing; Distributed computing; Educational institutions; Large-scale systems; Machine learning; Quadratic programming; Support vector machine classification; Support vector machines; Training data; Support Vector Machine; cascade SVM; distance mean; feedback; parallel learning;
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
DOI :
10.1109/ICMLC.2005.1527207