DocumentCode :
2311850
Title :
Comparison of feature ranking methods based on information entropy
Author :
Duch, Wlodzislaw ; Wieczorek, Tadeusz ; Biesiada, Jacek ; Blachnik, Marcin
Author_Institution :
Dept. of Inf., Nicholas Copernicus Univ., Torun, Poland
Volume :
2
fYear :
2004
fDate :
25-29 July 2004
Firstpage :
1415
Abstract :
A comparison between five feature ranking methods based on entropy is presented on artificial and real datasets. Feature ranking method using χ2 statistics gives results that are very similar to the entropy-based methods. The quality of feature rankings obtained by these methods is evaluated using the decision tree and the nearest neighbor classifier with growing number of most important features. Significant differences are found in some cases, but there is no single best index that works best for all data and all classifiers. Therefore to be sure that a subset of features giving highest accuracy has been selected requires the use of many different indices.
Keywords :
decision trees; entropy; feature extraction; pattern classification; set theory; statistics; χ2 statistics; artificial datasets; decision trees; feature ranking methods; information entropy based methods; nearest neighbor classifier; real datasets; subsets; Bioinformatics; Classification tree analysis; Decision trees; Feature extraction; Filters; Informatics; Information entropy; Nearest neighbor searches; Statistics; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN :
1098-7576
Print_ISBN :
0-7803-8359-1
Type :
conf
DOI :
10.1109/IJCNN.2004.1380157
Filename :
1380157
Link To Document :
بازگشت