Title :
Fast Feature Selection Method for Continuous Attributes with Nominal Class
Author :
Mejía-Lavalle, Manuel ; Morales, Eduardo F. ; Rodríguez, Guillermo
Author_Institution :
Instituto de Investigaciones Electricas, Mexico
Abstract :
Feature selection has become a relevant pre-processing problem on knowledge discovery in databases, because of very large databases or because some attributes are expensive to obtain. There is a large number of diverse feature selection methods for databases with pure nominal data (attributes and class), or pure continuous data, but little work has been done for the case of continuous attributes with nominal class. Normally what we can do is perform discretization, and then apply some traditional feature selection method; however the results can vary greatly depending on the discretization method used. We propose a direct method for feature selection on continuous data with nominal class, inspired in the Shannon¿s entropy and an Information Gain measure. In the experiments that we realized, with synthetic and real databases, the proposed method has shown to be fast and to produce very competitive solutions with a small set of attributes
Keywords :
Accuracy; Artificial intelligence; Data mining; Entropy; Filters; Gain measurement; Prediction algorithms; Predictive models; Spatial databases; Supervised learning;
Conference_Titel :
Artificial Intelligence, 2006. MICAI '06. Fifth Mexican International Conference on
Conference_Location :
Mexico City, Mexico
Print_ISBN :
0-7695-2722-1
DOI :
10.1109/MICAI.2006.14