Title :
Sample Reduction for SVMs via Data Structure Analysis
Author :
Wang, Defeng ; Yeung, Daniel S. ; Eric, C.C.T.
Author_Institution :
Dept. of Comput., Hong Kong Polytech. Univ., Hong Kong
Abstract :
This paper presents a new sample reduction algorithm, sample reduction by data structure analysis (SR-DSA), for SVMs to improve their scalability. SR-DSA utilizes data structure information in determining which data points are not useful in learning the separating plane and could be removed. As this algorithm is performed before SVMs training, it avoids the problem suffered by most sample reduction methods whose choices of samples heavily depend on repeatedly training of SVMs. Experiments on both synthetic and real world datasets have shown that SR-DSA is capable of reducing the number of samples as well as the time for SVMs training while maintaining high testing accuracy.
Keywords :
data structures; support vector machines; SVM; data structure; sample reduction; Data analysis; Data structures; Kernel; Probability distribution; Quadratic programming; Sampling methods; Scalability; Statistical learning; Support vector machines; Testing; Hierarchical Clustering; Mahalanobis distance; Sample Reduction; Support Vector Machines (SVMs);
Conference_Titel :
System of Systems Engineering, 2007. SoSE '07. IEEE International Conference on
Conference_Location :
San Antonio, TX
Print_ISBN :
1-4244-1159-9
Electronic_ISBN :
1-4244-1160-2
DOI :
10.1109/SYSOSE.2007.4304333