Title :
Data Selection Using Decision Tree for SVM Classification
Author :
Lopez-Chau, A. ; Garcia, L.L. ; Cervantes, J. ; Xiaoou Li ; Wen Yu
Author_Institution :
Univ. Autonoa del Estado de Mexico, Mexico City, Mexico
Abstract :
Support Vector Machine (SVM) is an important classification method used in a many areas. The training of SVM is almost O(n^{2}) in time and space. Some methods to reduce the training complexity have been proposed in last years. Data selection methods for SVM select most important examples from training data sets to improve its training time. This paper introduces a novel data reduction method that works detecting clusters and then selects some examples from them. Different from other state of the art algorithms, the novel method uses a decision tree to form partitions that are treated as clusters, and then executes a guided random selection of examples. The clusters discovered by a decision tree can be linearly separable, taking advantage of the Eidelheit separation theorem, it is possible to reduce the size of training sets by carefully selecting examples from training sets. The novel method was compared with LibSVM using public available data sets, experiments demonstrate an important reduction of the size of training sets whereas showing only a slight decreasing in the accuracy of classifier.
Keywords :
computational complexity; data reduction; decision trees; pattern clustering; support vector machines; Eidelheit separation theorem; LibSVM; SVM classification method; SVM training; cluster detection; data reduction method; data selection methods; decision tree; guided random selection; space complexity; support vector machine; time complexity; training complexity; training data sets; Accuracy; Clustering algorithms; Decision trees; Entropy; Impurities; Support vector machines; Training; Data reduction; Decision tree; Eidelheit separation theorem; SVM;
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2012 IEEE 24th International Conference on
Conference_Location :
Athens
Print_ISBN :
978-1-4799-0227-9
DOI :
10.1109/ICTAI.2012.105