• DocumentCode
    2427794
  • Title

    An Improved Attribute Selection Measure for Decision Tree Induction

  • Author

    Wang, Dianhong ; Jiang, Liangxiao

  • Author_Institution
    China Univ. of Geosciences, Wuhan
  • Volume
    4
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    654
  • Lastpage
    658
  • Abstract
    Decision tree learning is one of the most widely used and practical methods for inductive inference. A fundamental issue in it is the attribute selection measure. The information gain measure is the most popular one for addressing this issue. However, a notable disadvantage of it is that it is biased towards selecting attributes with many values. Motivated by this fact, the gain ratio measure penalizes the attributes with many values by incorporating a term called split information. Unfortunately, the gain ratio measure suffers from another inevitable practical issue that the denominator sometimes is zero or very small. In this paper, we single out an improved attribute selection measure called average gain, which penalizes the attributes with many values by dividing the number of attribute values. We experimentally tested its effectiveness using 36 UCI data sets.
  • Keywords
    decision trees; inference mechanisms; learning (artificial intelligence); attribute selection measure; average gain; decision tree inductive learning; inductive inference; split information; Computer science; Decision trees; Equations; Fuzzy systems; Gain measurement; Geology; Performance gain; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.161
  • Filename
    4406468