DocumentCode :
1097558
Title :
Distributed Parallel Support Vector Machines in Strongly Connected Networks
Author :
Lu, Yumao ; Roychowdhury, Vwani ; Vandenberghe, Lieven
Author_Institution :
Yahoo! Inc., Sunnyvale, CA
Volume :
19
Issue :
7
fYear :
2008
fDate :
7/1/2008 12:00:00 AM
Firstpage :
1167
Lastpage :
1178
Abstract :
In this paper, we propose a distributed parallel support vector machine (DPSVM) training mechanism in a configurable network environment for distributed data mining. The basic idea is to exchange support vectors among a strongly connected network (SCN) so that multiple servers may work concurrently on distributed data set with limited communication cost and fast training speed. The percentage of servers that can work in parallel and the communication overhead may be adjusted through network configuration. The proposed algorithm further speeds up through online implementation and synchronization. We prove that the global optimal classifier can be achieved iteratively over an SCN. Experiments on a real-world data set show that the computing time scales well with the size of the training data for most networks. Numerical results show that a randomly generated SCN may achieve better performance than the state of the art method, cascade SVM, in terms of total training time.
Keywords :
data mining; iterative methods; network theory (graphs); optimisation; parallel processing; pattern classification; support vector machines; configurable network environment; distributed data mining; distributed parallel support vector machine; global optimal classifier; iterative method; strongly connected network; Convergence; distributed data mining; parallel computing; strongly connected networks (SCNs); support vector machine (SVM);
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/TNN.2007.2000061
Filename :
4470008
Link To Document :
بازگشت