• DocumentCode
    2023090
  • Title

    A distributed SVM for image annotation

  • Author

    Alham, Nasullah Khalid ; Li, Maozhen ; Hammoud, Suhel ; Liu, Yang ; Ponraj, Mahesh

  • Author_Institution
    Sch. of Eng. & Design, Brunel Univ., Uxbridge, UK
  • Volume
    6
  • fYear
    2010
  • fDate
    10-12 Aug. 2010
  • Firstpage
    2983
  • Lastpage
    2987
  • Abstract
    The popularity of SVMs has grown tremendously in the last few years for many different classification problems due to its generalization properties, however training SVMs require high computational power. Platt´s SMO is one the fastest algorithm for training support vector machines, which takes the decomposition technique to the extreme by selecting a set of only two points as the working set then solving them analytically. However SMO becomes slow for large size training data set. In this paper we present a MapReduce based distributed implementation of SMO using Hadoop. The distributed SMO uses multiple core processors to process the training data. By partitioning the training data set into smaller subsets and allocating each of the partitioned subsets to a single Map task, each Map task optimizes the partition in parallel and finally the reducer combine the results. Experiments show the efficiency of the distributed SMO increases with the increase of the number of processors, the training speed of distributed SMO with 12 Map task is about 11times higher than standalone SMO. There is no significant difference in accuracy between distributed and standalone SMO.
  • Keywords
    distributed processing; generalisation (artificial intelligence); image classification; multiprocessing systems; support vector machines; Hadoop; MapReduce based distributed implementation; Platt SMO; decomposition technique; distributed SMO; distributed SVM; generalization property; image annotation; multiple core processors; sequential minimal optimization; support vector machines; Accuracy; Algorithm design and analysis; Classification algorithms; Program processors; Support vector machines; Training; Training data; SMO; distributed SVM; image anotation; machine learning; mapReduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
  • Conference_Location
    Yantai, Shandong
  • Print_ISBN
    978-1-4244-5931-5
  • Type

    conf

  • DOI
    10.1109/FSKD.2010.5569084
  • Filename
    5569084