Title :
Gene selection for cancer classification using bootstrapped genetic algorithms and support vector machines
Author_Institution :
Dept. of Electr. & Comput. Eng., California State Univ., Northridge, CA, USA
Abstract :
The gene expression data obtained from microarrays have shown useful in cancer classification. DNA microarray data have extremely high dimensionality compared to the small number of available samples. In this paper, we propose a novel system for selecting a set of genes for cancer classification. This system is based on a linear support vector machine and a genetic algorithm. To overcome the problem of the small size of training samples, bootstrap methods are combined into genetic search. Two databases are considered: the colon cancer database and the leukemia database. Our experimental results show that the proposed method is capable of finding genes that discriminate between normal cells and cancer cells and generalizes well.
Keywords :
DNA; arrays; biology computing; cancer; cellular biophysics; genetic algorithms; genetics; pattern classification; support vector machines; tumours; DNA microarray data; bootstrapped genetic algorithm; cancer classification; colon cancer database; gene expression data; gene selection; leukemia database; support vector machines; Cancer; Colon; DNA; Databases; Frequency; Gene expression; Genetic algorithms; Pharmaceutical technology; Support vector machine classification; Support vector machines;
Conference_Titel :
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN :
0-7695-2000-6
DOI :
10.1109/CSB.2003.1227389