• DocumentCode
    745273
  • Title

    A modified Chi2 algorithm for discretization

  • Author

    Tay, Francis E H ; Shen, Lixinang

  • Author_Institution
    Dept. of Mech. Eng., Nat. Univ. of Singapore, Singapore
  • Volume
    14
  • Issue
    3
  • fYear
    2002
  • Firstpage
    666
  • Lastpage
    670
  • Abstract
    Since the ChiMerge algorithm was first proposed by Kerber (1992), it has become a widely used and discussed discretization method. The Chi2 algorithm is a modification to the ChiMerge method. It automates the discretization process by introducing an inconsistency rate as the stopping criterion and it automatically selects the significance value. In addition, it adds a finer phase aimed at feature selection to broaden the applications of the ChiMerge algorithm. However, the Chi2 algorithm does not consider the inaccuracy inherent in ChiMerge´s merging criterion. The user-defined inconsistency rate also brings about inaccuracy to the discretization process. These two drawbacks are first discussed in this paper and modifications to overcome them are then proposed. By comparison, results with the original Chi2 algorithm using C4.5, the modified Chi2 algorithm, performs better than the original Chi2 algorithm. It becomes a completely automatic discretization method
  • Keywords
    heuristic programming; learning (artificial intelligence); merging; C4.5 method; ChiMerge algorithm; discretization; heuristic method; inconsistency rate; machine learning; merging; modified Chi2 algorithm; stopping criterion; Merging;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2002.1000349
  • Filename
    1000349