Title :
A distributed SVM for image annotation
Author :
Alham, Nasullah Khalid ; Li, Maozhen ; Hammoud, Suhel ; Liu, Yang ; Ponraj, Mahesh
Author_Institution :
Sch. of Eng. & Design, Brunel Univ., Uxbridge, UK
Abstract :
The popularity of SVMs has grown tremendously in the last few years for many different classification problems due to its generalization properties, however training SVMs require high computational power. Platt´s SMO is one the fastest algorithm for training support vector machines, which takes the decomposition technique to the extreme by selecting a set of only two points as the working set then solving them analytically. However SMO becomes slow for large size training data set. In this paper we present a MapReduce based distributed implementation of SMO using Hadoop. The distributed SMO uses multiple core processors to process the training data. By partitioning the training data set into smaller subsets and allocating each of the partitioned subsets to a single Map task, each Map task optimizes the partition in parallel and finally the reducer combine the results. Experiments show the efficiency of the distributed SMO increases with the increase of the number of processors, the training speed of distributed SMO with 12 Map task is about 11times higher than standalone SMO. There is no significant difference in accuracy between distributed and standalone SMO.
Keywords :
distributed processing; generalisation (artificial intelligence); image classification; multiprocessing systems; support vector machines; Hadoop; MapReduce based distributed implementation; Platt SMO; decomposition technique; distributed SMO; distributed SVM; generalization property; image annotation; multiple core processors; sequential minimal optimization; support vector machines; Accuracy; Algorithm design and analysis; Classification algorithms; Program processors; Support vector machines; Training; Training data; SMO; distributed SVM; image anotation; machine learning; mapReduce;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5931-5
DOI :
10.1109/FSKD.2010.5569084