DocumentCode :
1470658
Title :
Feature selection via discretization
Author :
Liu, Huan ; Setiono, Rudy
Author_Institution :
Dept. of Inf. Syst. & Comput. Sci., Nat. Univ. of Singapore, Singapore
Volume :
9
Issue :
4
fYear :
1997
Firstpage :
642
Lastpage :
645
Abstract :
Discretization can turn numeric attributes into discrete ones. Feature selection can eliminate some irrelevant and/or redundant attributes. Chi2 is a simple and general algorithm that uses the χ 2 statistic to discretize numeric attributes repeatedly until some inconsistencies are found in the data. It achieves feature selection via discretization. It can handle mixed attributes, work with multiclass data, and remove irrelevant and redundant attributes
Keywords :
data handling; feature extraction; learning (artificial intelligence); pattern classification; Chi2; chi2 statistic; discretization; feature selection; general algorithm; inconsistencies; mixed attributes; multiclass data; numeric attributes; pattern classification; redundant attribute removal; redundant attributes; Accuracy; Classification algorithms; Computer science; Information systems; Merging; Notice of Violation; Pattern classification; Remuneration; Statistics; Training data;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.617056
Filename :
617056
Link To Document :
بازگشت