DocumentCode :
3140736
Title :
Comparison of two families of entropy-based classification measures with and without feature selection
Author :
Giles, Kendall ; Bryson, Kweku-Muata ; Weng, Qin
Author_Institution :
Dept. of Inf. Syst., Virginia Commonwealth Univ., Richmond, VA, USA
fYear :
2001
fDate :
6-6 Jan. 2001
Abstract :
Many decision tree (DT) induction algorithms, including the popular C4.5, are based on the conditional entropy (CE) measure. An interesting question involves the relative performance of other entropy measures, such as class-attribute mutual information (CAMI). We therefore conducted a theoretical analysis of CAMI that enabled us to expose relationships with CE and correct a previous CAMI result. Our computational study showed that there was only a small variation in the performance of the two measures. Since feature selection is important in DT induction, we conducted a theoretical analysis of a recently-published blurring-based feature selection algorithm and developed a new feature selection algorithm. We tested this algorithm on a wider set of test problems than in the comparable study in order to identify the benefits and limitations of blurring-based feature selection. These results provide theoretical and computational insight into entropy-based induction measures and feature selection algorithms.
Keywords :
entropy; feature extraction; pattern classification; C4.5 algorithm; blurring-based feature selection algorithm; class-attribute mutual information; conditional entropy; decision tree induction algorithms; entropy-based classification measures; entropy-based induction measures; relative performance; Algorithm design and analysis; Classification tree analysis; Data mining; Decision trees; Entropy; Humans; Information systems; Mutual information; Particle measurements; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 2001. Proceedings of the 34th Annual Hawaii International Conference on
Conference_Location :
Maui, HI, USA
Print_ISBN :
0-7695-0981-9
Type :
conf
DOI :
10.1109/HICSS.2001.926307
Filename :
926307
Link To Document :
بازگشت