• DocumentCode
    3140736
  • Title

    Comparison of two families of entropy-based classification measures with and without feature selection

  • Author

    Giles, Kendall ; Bryson, Kweku-Muata ; Weng, Qin

  • Author_Institution
    Dept. of Inf. Syst., Virginia Commonwealth Univ., Richmond, VA, USA
  • fYear
    2001
  • fDate
    6-6 Jan. 2001
  • Abstract
    Many decision tree (DT) induction algorithms, including the popular C4.5, are based on the conditional entropy (CE) measure. An interesting question involves the relative performance of other entropy measures, such as class-attribute mutual information (CAMI). We therefore conducted a theoretical analysis of CAMI that enabled us to expose relationships with CE and correct a previous CAMI result. Our computational study showed that there was only a small variation in the performance of the two measures. Since feature selection is important in DT induction, we conducted a theoretical analysis of a recently-published blurring-based feature selection algorithm and developed a new feature selection algorithm. We tested this algorithm on a wider set of test problems than in the comparable study in order to identify the benefits and limitations of blurring-based feature selection. These results provide theoretical and computational insight into entropy-based induction measures and feature selection algorithms.
  • Keywords
    entropy; feature extraction; pattern classification; C4.5 algorithm; blurring-based feature selection algorithm; class-attribute mutual information; conditional entropy; decision tree induction algorithms; entropy-based classification measures; entropy-based induction measures; relative performance; Algorithm design and analysis; Classification tree analysis; Data mining; Decision trees; Entropy; Humans; Information systems; Mutual information; Particle measurements; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Sciences, 2001. Proceedings of the 34th Annual Hawaii International Conference on
  • Conference_Location
    Maui, HI, USA
  • Print_ISBN
    0-7695-0981-9
  • Type

    conf

  • DOI
    10.1109/HICSS.2001.926307
  • Filename
    926307