Title :
A geometric approach to train SVM on very large data sets
Author :
Zeng, Zhi-Qiang ; Xu, Hua-Rong ; Xie, Yan-Qi ; Gao, Ji
Author_Institution :
Dept. of Comput. Sci. & Technol., Xiamen Univ. of Technol., Xiamen, China
Abstract :
Reduced set method is an important approach to speed up support vector machine (SVM) training on large data sets. Existing works mainly focused on selecting patterns near the decision boundary for SVM training by applying clustering, nearest neighbor algorithm and so on. However, on very large data sets, these algorithms require huge computational overhead, and thus the total running time is still enormous. In this paper, an intuitive geometric method is developed to select convex hull samples in the feature space for SVM training, which has a time complexity that is linear with training set size n. Experiments on real data sets show that the proposed method not only preserves the generalization performance of the result SVM classifiers, but outperforms existing scale-up methods in terms of training time and number of support vectors.
Keywords :
computational complexity; geometry; support vector machines; SVM; computational overhead; convex hull samples; geometric approach; intuitive geometric method; nearest neighbor algorithm; reduced set method; support vector machine training; time complexity; very large data sets; Clustering algorithms; Computer science; Data engineering; Intelligent systems; Knowledge engineering; Nearest neighbor searches; Physics; Quadratic programming; Support vector machine classification; Support vector machines;
Conference_Titel :
Intelligent System and Knowledge Engineering, 2008. ISKE 2008. 3rd International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4244-2196-1
Electronic_ISBN :
978-1-4244-2197-8
DOI :
10.1109/ISKE.2008.4731074