• DocumentCode
    1462028
  • Title

    A new criterion in selection and discretization of attributes for the generation of decision trees

  • Author

    Jun, Byung Hwan ; Kim, Chang Soo ; Song, Hong-Yeop ; Kim, Jaihie

  • Author_Institution
    Dept. of Comput. Sci., Kongju Nat. Univ., Chungnam, South Korea
  • Volume
    19
  • Issue
    12
  • fYear
    1997
  • fDate
    12/1/1997 12:00:00 AM
  • Firstpage
    1371
  • Lastpage
    1375
  • Abstract
    It is important to use a better criterion in selection and discretization of attributes for the generation of decision trees to construct a better classifier in the area of pattern recognition in order to intelligently access huge amount of data efficiently. Two well-known criteria are gain and gain ratio, both based on the entropy of partitions. We propose in this paper a new criterion based also on entropy, and use both theoretical analysis and computer simulation to demonstrate that it works better than gain or gain ratio in a wide variety of situations. We use the usual entropy calculation where the base of the logarithm is not two but the number of successors to the node. Our theoretical analysis leads some specific situations in which the new criterion works always better than gain or gain ratio, and the simulation result may implicitly cover all the other situations not covered by the analysis
  • Keywords
    decision theory; entropy; pattern classification; trees (mathematics); attribute discretization; attribute selection; classifier; decision tree generation; partition entropy; pattern recognition; Analytical models; Classification tree analysis; Computer Society; Computer simulation; Decision trees; Entropy; Error analysis; Gain measurement; Machine learning; Pattern recognition;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/34.643896
  • Filename
    643896