DocumentCode
424110
Title
Gene selection for leukemia subtype classification from gene expression profile
Author
Li, Ying-Xin ; Zhu, Yun-Hua ; Ruan, Xiao-gang
Author_Institution
Sch. of Electron. Inf. & Control Eng., Beijing Univ. of Technol., China
Volume
3
fYear
2004
fDate
26-29 Aug. 2004
Firstpage
1661
Abstract
It is very important but difficult to identify which genes in gene expression data can contribute most to tumor subtype classification. An approach to select a small subset of genes for leukemia subtype classification from large scale gene expression profile is proposed in this paper. Having removed the noisy genes with little relevance to the classification task, the "sequential floating forward search" method was employed to generate candidate feature subsets consisting of informative genes, and then, a support vector machine was employed as a classifier to select the optimal feature subset with minimum classification errors. The results of our experiment showed that all the samples can be correctly classified without any error with only five genes.
Keywords
cancer; feature extraction; genetics; pattern classification; support vector machines; tumours; classification errors; gene expression data; gene expression profile; gene selection; informative genes; leukemia subtype classification; optimal feature subset selection; sequential floating forward search method; support vector machine; tumor subtype classification; Cancer; Control engineering; Data analysis; Electronic mail; Gene expression; Large-scale systems; Neoplasms; Noise generators; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN
0-7803-8403-2
Type
conf
DOI
10.1109/ICMLC.2004.1382042
Filename
1382042
Link To Document