DocumentCode :
1902226
Title :
Feature Selection with Interactions for Continuous Attributes and Discrete Class
Author :
Mejía-Lavalle, Manuel ; Rodríguez, Guillermo
Author_Institution :
Inst. de Investigaciones Electricas, Cuernavaca
fYear :
2007
fDate :
25-28 Sept. 2007
Firstpage :
318
Lastpage :
323
Abstract :
Nowadays there exist diverse feature selection ranking methods and metrics for databases with pure discrete data (attributes and class), or pure continuous data. However, little work has been done for the case of continuous attributes with discrete class, and at the same time evaluating attribute subsets considering its inter- dependencies or interactions. Normally what we can do is perform discretization, and then apply some traditional feature selection method; nevertheless the results vary depending on the discretization method that we utilized. Additionally, if we only evaluate isolated attributes, we probably obtain poor results, because we are not considering attribute inter-dependencies. We propose a metric and method for feature selection on continuous data with discrete class, inspired in the Shannon´s entropy and the Information Gain, which overcomes the above problems. In the experiments that we realized, with synthetic and real databases, the proposed method has shown to be fast and produce near optimum solutions, selecting few attributes.
Keywords :
database theory; entropy; Shannon entropy; attribute subsets; continuous attributes; databases; discrete class; discretization method; feature selection; feature selection ranking methods; information gain; Accuracy; Automotive engineering; Data mining; Entropy; Filters; Prediction algorithms; Predictive models; Robots; Spatial databases; Supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics, Robotics and Automotive Mechanics Conference, 2007. CERMA 2007
Conference_Location :
Morelos
Print_ISBN :
978-0-7695-2974-5
Type :
conf
DOI :
10.1109/CERMA.2007.4367706
Filename :
4367706
Link To Document :
بازگشت