DocumentCode
3117108
Title
A hybrid filter/wrapper approach of feature selection for gene expression data
Author
Ke, Chao-hsuan ; Yang, Cheng-Hong ; Chuang, Li-Yeh ; Yang, Cheng-San
Author_Institution
Dept. of Electron. Eng., Nat. Kaohsiung Univ. of Appl. Sci., Kaohsiung
fYear
2008
fDate
12-15 Oct. 2008
Firstpage
2664
Lastpage
2670
Abstract
In recent years, many studies have shown that microarray gene expression data is useful for disease identification and cancer classification. However, since gene expression data may contain thousands of genes simultaneously, successful microarray classification can be rather difficult. Feature (gene) selection is a frequently used pre-processing technology for successful classification of microarray gene expression data. Selecting a useful gene subset as a classifier not only decreases the computational time and cost, but also increases the classification accuracy. It is therefore imperative to extract only a small number of genes, which are exclusively relevant for the classification of a particular cancer/disease type. In this paper, correlation-based binary particle swarm optimizations is proposed to select the relevant genes, and a K-nearest neighbor with the leave-one-out cross-validation method serves as a classifier to evaluate the classification performance on six published cancer classification data sets. The experimental results show that the proposed method selects fewer gene subsets, while still resulting in higher prediction accuracy than the other literature methods.
Keywords
diseases; feature extraction; filtering theory; genetics; medical computing; particle swarm optimisation; pattern classification; K-nearest neighbor; cancer classification; cancer classification data sets; computational cost; computational time; correlation-based binary particle swarm optimizations; disease identification; feature selection; hybrid filter-wrapper approach; leave-one-out cross-validation method; microarray classification; microarray gene expression data; Cybernetics; Filters; Gene expression; Sliding mode control; classification; feature selection; particle swarm optimization;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
Conference_Location
Singapore
ISSN
1062-922X
Print_ISBN
978-1-4244-2383-5
Electronic_ISBN
1062-922X
Type
conf
DOI
10.1109/ICSMC.2008.4811698
Filename
4811698
Link To Document