DocumentCode :
3186813
Title :
Filtering for improved gene selection on microarray data
Author :
Canul-Reich, Juana ; Hall, Lawrence O. ; Goldgof, Dmitry ; Eschrich, Steven A.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
fYear :
2010
fDate :
10-13 Oct. 2010
Firstpage :
3250
Lastpage :
3257
Abstract :
Many genes and a small number of samples are problematic characteristics of microarray datasets. We investigated the impact on classification accuracy of gene selection approaches on filtered-to-200-gene datasets. Four datasets were used with 3 filters: Student´s t-test, information gain, and reliefF. We applied Iterative Feature Perturbation (IFP) and Recursive Feature Elimination (SVM-RFE) for further gene selection. Both approaches resulted in accuracy improvement when used with the t-test-filtered datasets, but not when information gain or reliefF were used. An AUC analysis of the IFP and SVM-RFE accuracy curves indicated that both methods reached the highest AUC values after t-test filtering. A statistical analysis of accuracy across the best 50 genes using the Friedman/Holm test showed that IFP and SVM-RFE were significantly more accurate more often when applied to the t-test-filtered gene sets. Surprisingly, the simple t-test, applied as a filter, results in the best overall SVM accuracy and is at least as accurate as the other, more complicated filter methods.
Keywords :
arrays; biology computing; feature extraction; genetics; molecular biophysics; recursive filters; statistical analysis; support vector machines; Friedman/Holm test; IFP; SVM-RFE; gene selection approach; iterative feature perturbation; microarray datasets; recursive feature elimination; statistical analysis; t-test filtering; Accuracy; Support vector machines; Variable speed drives; Feature selection; IFP; Information gain; ReliefF; SVM-RFE; microarray data; t-test;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1062-922X
Print_ISBN :
978-1-4244-6586-6
Type :
conf
DOI :
10.1109/ICSMC.2010.5642302
Filename :
5642302
Link To Document :
بازگشت