DocumentCode :
423706
Title :
An input variable importance definition based on empirical data probability and its use in variable selection
Author :
Lemaire, Vincent ; Clérot, Fabrice
Author_Institution :
DTL, France Telecom Res. & Dev., Lannion, France
Volume :
2
fYear :
2004
fDate :
25-29 July 2004
Firstpage :
1375
Abstract :
Variable and feature selection have become the focus of much research in areas of application for which datasets with tens or hundreds of thousands of variables are available. We propose a new method to score subsets of variables according to their usefulness for the performance of a given model. This method is applicable on every kind of model and on classification or regression task. We assess the efficiency of the method with our results on the NIPS 2003 feature selection challenge and with an example of a real application.
Keywords :
error statistics; feature extraction; neural nets; pattern classification; probability; regression analysis; set theory; NIPS 2003 feature selection; artificial neural networks; classification task; empirical data probability; error statistics; regression task; variable subset selection; Filters; Input variables; Performance evaluation; Predictive models; Research and development; Spatial databases; Telecommunications; Warehousing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN :
1098-7576
Print_ISBN :
0-7803-8359-1
Type :
conf
DOI :
10.1109/IJCNN.2004.1380149
Filename :
1380149
Link To Document :
بازگشت