DocumentCode :
2789958
Title :
Experimental analysis of feature selection stability for high-dimension and low-sample size gene expression classification task
Author :
Dernoncourt, D. ; Hanczar, B. ; Zucker, Jean-Daniel
Author_Institution :
Centre de Rech. des Cordeliers, Inst. Nat. de la Sante et de la Rech. Medicale, Paris, France
fYear :
2012
fDate :
11-13 Nov. 2012
Firstpage :
350
Lastpage :
355
Abstract :
Gene selection is a crucial step when building a classifier from microarray or metagenomic data. As the number of observations is small, the gene selection tends to be unstable. It is common that two gene subsets, obtained from different datasets but dealing with the same classification problem, do not overlap significantly. Although it is a crucial problem, few works have been done on the selection stability. In this paper, we first present some stability quantification methods, then we study the variations of those measures with various parameters (dimensionality, sample size, feature distribution, selection threshold) on both artificial and real data, as well as the resulting classification performance. Feature selection was performed with t-test and classification with linear discriminant analysis. We point out a strong empiric correlation between the dimensionality/sample size ratio and selection instability.
Keywords :
pattern classification; stability; statistical analysis; artificial data; dimensionality/sample size ratio; empiric correlation; feature selection; feature selection stability; gene selection; gene subsets; high-dimension gene expression classification task; linear discriminant analysis; low-sample size gene expression classification task; metagenomic data; microarray data; real data; selection instability; stability quantification methods; t-test; Correlation; Error analysis; Indexes; Size measurement; Stability criteria; Training; Feature selection; dimensionality/sample size ratio; small sample; stability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics & Bioengineering (BIBE), 2012 IEEE 12th International Conference on
Conference_Location :
Larnaca
Print_ISBN :
978-1-4673-4357-2
Type :
conf
DOI :
10.1109/BIBE.2012.6399649
Filename :
6399649
Link To Document :
بازگشت