DocumentCode
423706
Title
An input variable importance definition based on empirical data probability and its use in variable selection
Author
Lemaire, Vincent ; Clérot, Fabrice
Author_Institution
DTL, France Telecom Res. & Dev., Lannion, France
Volume
2
fYear
2004
fDate
25-29 July 2004
Firstpage
1375
Abstract
Variable and feature selection have become the focus of much research in areas of application for which datasets with tens or hundreds of thousands of variables are available. We propose a new method to score subsets of variables according to their usefulness for the performance of a given model. This method is applicable on every kind of model and on classification or regression task. We assess the efficiency of the method with our results on the NIPS 2003 feature selection challenge and with an example of a real application.
Keywords
error statistics; feature extraction; neural nets; pattern classification; probability; regression analysis; set theory; NIPS 2003 feature selection; artificial neural networks; classification task; empirical data probability; error statistics; regression task; variable subset selection; Filters; Input variables; Performance evaluation; Predictive models; Research and development; Spatial databases; Telecommunications; Warehousing;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN
1098-7576
Print_ISBN
0-7803-8359-1
Type
conf
DOI
10.1109/IJCNN.2004.1380149
Filename
1380149
Link To Document