DocumentCode
2488908
Title
Kernel Bisecting k-means clustering for SVM training sample reduction
Author
Liu, Xiao-Zhang ; Feng, Guo-can
Author_Institution
Fac. of Math. & Comput., Sun Yat-sen Univ., Guangzhou
fYear
2008
fDate
8-11 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
This paper presents a new algorithm named kernel bisecting k-means and sample removal (KBK-SR) as a sampling preprocessing for SVM training to improve the scalability. The novel clustering approach kernel bisecting k-means in the KBK-SR tends to fast produce balanced clusters of similar sizes in the kernel feature space, which makes KBK-SR efficient and effective for reducing training samples for nonlinear SVMs. Theoretical analysis and experimental results on three UCI real data benchmarks both show that, with very short sampling time, our algorithm dramatically accelerates SVM training while maintaining high test accuracy.
Keywords
pattern clustering; sampling methods; support vector machines; SVM training sample reduction; UCI real data benchmarks; kernel bisecting k-means clustering; kernel feature space; nonlinear SVM; sample removal; sampling preprocessing; Algorithm design and analysis; Benchmark testing; Clustering algorithms; Kernel; Life estimation; Mathematics; Sampling methods; Scalability; Sun; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location
Tampa, FL
ISSN
1051-4651
Print_ISBN
978-1-4244-2174-9
Electronic_ISBN
1051-4651
Type
conf
DOI
10.1109/ICPR.2008.4761793
Filename
4761793
Link To Document