Title :
Chi2: feature selection and discretization of numeric attributes
Author :
Liu, Huan ; Setiono, Rudy
Author_Institution :
Dept. of Inf. Syst. & Comput. Sci., Nat. Univ. of Singapore, Singapore
Abstract :
Discretization can turn numeric attributes into discrete ones. Feature selection can eliminate some irrelevant attributes. This paper describes Chi2 a simple and general algorithm that uses the χ2 statistic to discretize numeric attributes repeatedly until some inconsistencies are found in the data, and achieves feature selection via discretization. The empirical results demonstrate that Chi2 is effective in feature selection and discretization of numeric and ordinal attributes
Keywords :
data handling; statistical analysis; uncertainty handling; Chi2; algorithm; data inconsistencies; discrete attributes; discretization; feature selection; irrelevant attributes; numeric attributes; ordinal attributes; Classification algorithms; Computer science; Equations; Frequency; Information systems; Remuneration; Statistics; Training data;
Conference_Titel :
Tools with Artificial Intelligence, 1995. Proceedings., Seventh International Conference on
Conference_Location :
Herndon, VA
Print_ISBN :
0-8186-7312-5
DOI :
10.1109/TAI.1995.479783