DocumentCode :
2441964
Title :
Feature selection increases cross-validation imprecision
Author :
Xiao, Yufei ; Hua, Jianping ; Dougherty, Edward R.
Author_Institution :
Dept. of Electr. Eng., Texas A&M Univ., College Station, TX
fYear :
2006
fDate :
28-30 May 2006
Firstpage :
17
Lastpage :
18
Abstract :
Even without feature selection, cross-validation error estimation is problematic for small samples owing to the high variance of the deviation distribution describing the difference between the estimated and true errors. This paper investigates the increased loss of cross-validation precision owing to feature selection by comparing deviation distributions and introducing two variation-based measures to quantify the further degradation in performance.
Keywords :
biology computing; estimation theory; feature extraction; genetics; sampling methods; statistical distributions; cross-validation error estimation; cross-validation imprecision; feature selection; genomics; high variance deviation distribution; Bioinformatics; Degradation; Distributed computing; Error analysis; Gaussian distribution; Genomics; Linear discriminant analysis; Loss measurement; Performance loss; Signal processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genomic Signal Processing and Statistics, 2006. GENSIPS '06. IEEE International Workshop on
Conference_Location :
College Station, TX
Print_ISBN :
1-4244-0384-7
Electronic_ISBN :
1-4244-0385-5
Type :
conf
DOI :
10.1109/GENSIPS.2006.353134
Filename :
4161755
Link To Document :
بازگشت