Title :
Fast Gene Selection for Microarray Data Using SVM-Based Evaluation Criterion
Author :
Zhou, Xin ; Wu, X.Y. ; Mao, K.Z. ; Tuck, David P.
Author_Institution :
Dept. of Pathology, Yale Univ. Sch. of Med., New Haven, CT
Abstract :
An important application of microarrays is to identify the relevant genes, among thousands of genes, for phenotypic classification. The performance of a gene selection algorithm is often assessed in terms of both predictive capacity and computational efficiency, but predictive capacity of selected features receives more attention than does computational efficiency. However, in gene selection problems, the computational efficiency is equally important because of very high dimensionality of gene expression data. We propose an SVM-IRFS algorithm which combines Support Vector Machine (SVM) based criterion, generalized parwpar2 measure, with a new search procedure, named as Iterative Reduced Forward Selection (IRFS), to address the gene selection problem. In the IRFS, an adaptive threshold is used to screen the irrelevant feature subsets, thus unnecessary computations can be avoided. The advantage of our proposed SVM-IRFS algorithm is twofold. First, the selection procedure of SVM-IRFS algorithm is computationally very efficient. It can identify tens from thousands of genes in several seconds. Second, benefiting from the good classification performance of support vector machines, SVM-IRFS produces the feature subset with high predictive capacity.
Keywords :
biology computing; genetics; genomics; iterative methods; molecular biophysics; pattern classification; support vector machines; SVM-IRFS algorithm; SVM-based evaluation criterion; computational efficiency; gene selection; iterative reduced forward selection; microarray data; phenotypic classification; predictive capacity; support vector machine; Bioinformatics; Biomedical engineering; Computational efficiency; Data engineering; Gene expression; Iterative algorithms; Pathology; Support vector machine classification; Support vector machines; USA Councils; feature selection; microarray data analysis; support vector machines;
Conference_Titel :
Bioinformatics and Biomedicine, 2008. BIBM '08. IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-0-7695-3452-7
DOI :
10.1109/BIBM.2008.57