DocumentCode :
3424731
Title :
New filter-based feature selection criteria for identifying differentially expressed genes
Author :
Loo, Lit-Hsin ; Roberts, Samuel ; Hrebien, Leonid ; Kam, Moshe
Author_Institution :
Bauer Center for Genomics Res., Harvard Univ., Cambridge, MA, USA
fYear :
2005
fDate :
15-17 Dec. 2005
Abstract :
We propose two new filter-based feature selection criteria for identifying differentially expressed genes, namely the average difference score (ADS) and the mean difference score (MDS). These criteria replace the serial noise estimator used in existing criteria by a parallel noise estimator. The result is better detection of changes in the variance of expression levels, which t-statistic type criteria tend to under-emphasize. We compare the performance of the new criteria to that of several commonly used feature selection criteria, including the Welch t-statistic, the Fisher correlation score, the Wilcoxon rank sum, and the independently consistent expression discriminator, on synthetic data and real biological data obtained from acute lymphoblastic leukemia and acute myeloid leukemia patients. We find that ADS and MDS outperform the other criteria by exhibiting higher sensitivity and comparable specificity. ADS is also able to flag several biologically important genes that are missed by the Welch t-statistic.
Keywords :
biology; feature extraction; filtering theory; statistical testing; Fisher correlation score; Welch t-statistic; Wilcoxon rank sum; average difference score; expressed genes; filter-based feature selection criteria; independently consistent expression discriminator; lymphoblastic leukemia; mean difference score; myeloid leukemia; parallel noise estimator; serial noise estimator; t-statistic type criteria; Biological processes; Biomarkers; Clinical diagnosis; DNA; Data models; Pathology; Power generation economics; Statistical analysis; Statistical distributions; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications, 2005. Proceedings. Fourth International Conference on
Print_ISBN :
0-7695-2495-8
Type :
conf
DOI :
10.1109/ICMLA.2005.48
Filename :
1607442
Link To Document :
بازگشت