DocumentCode :
61691
Title :
Feature-Selected Tree-Based Classification
Author :
Freeman, Chas ; Kulic, Dana ; Basir, Otman
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Waterloo, Waterloo, ON, Canada
Volume :
43
Issue :
6
fYear :
2013
fDate :
Dec. 2013
Firstpage :
1990
Lastpage :
2004
Abstract :
Feature selection can decrease classifier size and improve accuracy by removing noisy and/or redundant features. However, it is possible for feature selection to yield features that are only partially informative about the classes in the set. These features are beneficial for distinguishing between some classes but not others. In these cases, it is beneficial to divide the large classification problem into a set of smaller problems, where a more specific set of features can be used to classify different classes. Dividing a problem this way is also common when the base classifier is binary, and the problem needs to be reformulated as a set of two-class problems so it can be handled by the classifier. This paper presents a method for multiclass classification that simultaneously formulates a binary tree of simpler classification subproblems and performs feature selection for the individual classifiers. The feature selected hierarchical classifier (FSHC) is tested against several well-known techniques for multiclass division. Tests are run on nine different real data sets and one artificial data set using a support vector machine (SVM) classifier. The results show that the accuracy obtained by the FSHC is comparable with other common multiclass SVM methods. Furthermore, the results demonstrate that the algorithm creates solutions with fewer classifiers, fewer features, and a shorter testing time than the other SVM multiclass extensions.
Keywords :
pattern classification; support vector machines; trees (mathematics); FSHC; SVM classifier; SVM multiclass extensions; artificial data set; base classifier; classification subproblems binary tree; classifier size; feature selected hierarchical classifier; feature-selected tree-based classification; multiclass SVM methods; multiclass classification; multiclass division; real data sets; support vector machine classifier; two-class problems; Accuracy; Binary trees; Bioinformatics; Genomics; Support vector machines; Testing; Training; Classification algorithms; genetic algorithms; supervised learning; support vector machines;
fLanguage :
English
Journal_Title :
Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
2168-2267
Type :
jour
DOI :
10.1109/TSMCB.2012.2237394
Filename :
6464621
Link To Document :
بازگشت