• DocumentCode
    1470658
  • Title

    Feature selection via discretization

  • Author

    Liu, Huan ; Setiono, Rudy

  • Author_Institution
    Dept. of Inf. Syst. & Comput. Sci., Nat. Univ. of Singapore, Singapore
  • Volume
    9
  • Issue
    4
  • fYear
    1997
  • Firstpage
    642
  • Lastpage
    645
  • Abstract
    Discretization can turn numeric attributes into discrete ones. Feature selection can eliminate some irrelevant and/or redundant attributes. Chi2 is a simple and general algorithm that uses the χ 2 statistic to discretize numeric attributes repeatedly until some inconsistencies are found in the data. It achieves feature selection via discretization. It can handle mixed attributes, work with multiclass data, and remove irrelevant and redundant attributes
  • Keywords
    data handling; feature extraction; learning (artificial intelligence); pattern classification; Chi2; chi2 statistic; discretization; feature selection; general algorithm; inconsistencies; mixed attributes; multiclass data; numeric attributes; pattern classification; redundant attribute removal; redundant attributes; Accuracy; Classification algorithms; Computer science; Information systems; Merging; Notice of Violation; Pattern classification; Remuneration; Statistics; Training data;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.617056
  • Filename
    617056