• DocumentCode
    2085317
  • Title

    A geometric approach to train SVM on very large data sets

  • Author

    Zeng, Zhi-Qiang ; Xu, Hua-Rong ; Xie, Yan-Qi ; Gao, Ji

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Xiamen Univ. of Technol., Xiamen, China
  • Volume
    1
  • fYear
    2008
  • fDate
    17-19 Nov. 2008
  • Firstpage
    991
  • Lastpage
    996
  • Abstract
    Reduced set method is an important approach to speed up support vector machine (SVM) training on large data sets. Existing works mainly focused on selecting patterns near the decision boundary for SVM training by applying clustering, nearest neighbor algorithm and so on. However, on very large data sets, these algorithms require huge computational overhead, and thus the total running time is still enormous. In this paper, an intuitive geometric method is developed to select convex hull samples in the feature space for SVM training, which has a time complexity that is linear with training set size n. Experiments on real data sets show that the proposed method not only preserves the generalization performance of the result SVM classifiers, but outperforms existing scale-up methods in terms of training time and number of support vectors.
  • Keywords
    computational complexity; geometry; support vector machines; SVM; computational overhead; convex hull samples; geometric approach; intuitive geometric method; nearest neighbor algorithm; reduced set method; support vector machine training; time complexity; very large data sets; Clustering algorithms; Computer science; Data engineering; Intelligent systems; Knowledge engineering; Nearest neighbor searches; Physics; Quadratic programming; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent System and Knowledge Engineering, 2008. ISKE 2008. 3rd International Conference on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-1-4244-2196-1
  • Electronic_ISBN
    978-1-4244-2197-8
  • Type

    conf

  • DOI
    10.1109/ISKE.2008.4731074
  • Filename
    4731074