• DocumentCode
    2488908
  • Title

    Kernel Bisecting k-means clustering for SVM training sample reduction

  • Author

    Liu, Xiao-Zhang ; Feng, Guo-can

  • Author_Institution
    Fac. of Math. & Comput., Sun Yat-sen Univ., Guangzhou
  • fYear
    2008
  • fDate
    8-11 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper presents a new algorithm named kernel bisecting k-means and sample removal (KBK-SR) as a sampling preprocessing for SVM training to improve the scalability. The novel clustering approach kernel bisecting k-means in the KBK-SR tends to fast produce balanced clusters of similar sizes in the kernel feature space, which makes KBK-SR efficient and effective for reducing training samples for nonlinear SVMs. Theoretical analysis and experimental results on three UCI real data benchmarks both show that, with very short sampling time, our algorithm dramatically accelerates SVM training while maintaining high test accuracy.
  • Keywords
    pattern clustering; sampling methods; support vector machines; SVM training sample reduction; UCI real data benchmarks; kernel bisecting k-means clustering; kernel feature space; nonlinear SVM; sample removal; sampling preprocessing; Algorithm design and analysis; Benchmark testing; Clustering algorithms; Kernel; Life estimation; Mathematics; Sampling methods; Scalability; Sun; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
  • Conference_Location
    Tampa, FL
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-2174-9
  • Electronic_ISBN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2008.4761793
  • Filename
    4761793