DocumentCode
2023090
Title
A distributed SVM for image annotation
Author
Alham, Nasullah Khalid ; Li, Maozhen ; Hammoud, Suhel ; Liu, Yang ; Ponraj, Mahesh
Author_Institution
Sch. of Eng. & Design, Brunel Univ., Uxbridge, UK
Volume
6
fYear
2010
fDate
10-12 Aug. 2010
Firstpage
2983
Lastpage
2987
Abstract
The popularity of SVMs has grown tremendously in the last few years for many different classification problems due to its generalization properties, however training SVMs require high computational power. Platt´s SMO is one the fastest algorithm for training support vector machines, which takes the decomposition technique to the extreme by selecting a set of only two points as the working set then solving them analytically. However SMO becomes slow for large size training data set. In this paper we present a MapReduce based distributed implementation of SMO using Hadoop. The distributed SMO uses multiple core processors to process the training data. By partitioning the training data set into smaller subsets and allocating each of the partitioned subsets to a single Map task, each Map task optimizes the partition in parallel and finally the reducer combine the results. Experiments show the efficiency of the distributed SMO increases with the increase of the number of processors, the training speed of distributed SMO with 12 Map task is about 11times higher than standalone SMO. There is no significant difference in accuracy between distributed and standalone SMO.
Keywords
distributed processing; generalisation (artificial intelligence); image classification; multiprocessing systems; support vector machines; Hadoop; MapReduce based distributed implementation; Platt SMO; decomposition technique; distributed SMO; distributed SVM; generalization property; image annotation; multiple core processors; sequential minimal optimization; support vector machines; Accuracy; Algorithm design and analysis; Classification algorithms; Program processors; Support vector machines; Training; Training data; SMO; distributed SVM; image anotation; machine learning; mapReduce;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location
Yantai, Shandong
Print_ISBN
978-1-4244-5931-5
Type
conf
DOI
10.1109/FSKD.2010.5569084
Filename
5569084
Link To Document