DocumentCode
61691
Title
Feature-Selected Tree-Based Classification
Author
Freeman, Chas ; Kulic, Dana ; Basir, Otman
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of Waterloo, Waterloo, ON, Canada
Volume
43
Issue
6
fYear
2013
fDate
Dec. 2013
Firstpage
1990
Lastpage
2004
Abstract
Feature selection can decrease classifier size and improve accuracy by removing noisy and/or redundant features. However, it is possible for feature selection to yield features that are only partially informative about the classes in the set. These features are beneficial for distinguishing between some classes but not others. In these cases, it is beneficial to divide the large classification problem into a set of smaller problems, where a more specific set of features can be used to classify different classes. Dividing a problem this way is also common when the base classifier is binary, and the problem needs to be reformulated as a set of two-class problems so it can be handled by the classifier. This paper presents a method for multiclass classification that simultaneously formulates a binary tree of simpler classification subproblems and performs feature selection for the individual classifiers. The feature selected hierarchical classifier (FSHC) is tested against several well-known techniques for multiclass division. Tests are run on nine different real data sets and one artificial data set using a support vector machine (SVM) classifier. The results show that the accuracy obtained by the FSHC is comparable with other common multiclass SVM methods. Furthermore, the results demonstrate that the algorithm creates solutions with fewer classifiers, fewer features, and a shorter testing time than the other SVM multiclass extensions.
Keywords
pattern classification; support vector machines; trees (mathematics); FSHC; SVM classifier; SVM multiclass extensions; artificial data set; base classifier; classification subproblems binary tree; classifier size; feature selected hierarchical classifier; feature-selected tree-based classification; multiclass SVM methods; multiclass classification; multiclass division; real data sets; support vector machine classifier; two-class problems; Accuracy; Binary trees; Bioinformatics; Genomics; Support vector machines; Testing; Training; Classification algorithms; genetic algorithms; supervised learning; support vector machines;
fLanguage
English
Journal_Title
Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
2168-2267
Type
jour
DOI
10.1109/TSMCB.2012.2237394
Filename
6464621
Link To Document