Title :
Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms
Author :
Alba, Enrique ; García-Nieto, José ; Jourdan, Laetitia ; Talbi, El-Ghazali
Author_Institution :
Univ. de Malaga, Malaga
Abstract :
In this work we compare the use of a particle swarm optimization (PSO) and a genetic algorithm (GA) (both augmented with support vector machines SVM) for the classification of high dimensional microarray data. Both algorithms are used for finding small samples of informative genes amongst thousands of them. A SVM classifier with 10- fold cross-validation is applied in order to validate and evaluate the provided solutions. A first contribution is to prove that PSOsvm is able to find interesting genes and to provide classification competitive performance. Specifically, a new version of PSO, called Geometric PSO, is empirically evaluated for the first time in this work using a binary representation in Hamming space. In this sense, a comparison of this approach with a new GAsvm and also with other existing methods of literature is provided. A second important contribution consists in the actual discovery of new and challenging results on six public datasets identifying significant in the development of a variety of cancers (leukemia, breast, colon, ovarian, prostate, and lung).
Keywords :
cancer; genetic algorithms; genetics; medical image processing; particle swarm optimisation; support vector machines; GA/SVM hybrid algorithms; Hamming space; PSO/SVM hybrid algorithms; cancer classification; gene selection; genetic algorithm; geometric PSO; informative genes; microarray data; particle swarm optimization; support vector machines; Cancer; DNA; Filters; Gene expression; Genetic algorithms; Neoplasms; Particle swarm optimization; Space exploration; Support vector machine classification; Support vector machines;
Conference_Titel :
Evolutionary Computation, 2007. CEC 2007. IEEE Congress on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-1339-3
Electronic_ISBN :
978-1-4244-1340-9
DOI :
10.1109/CEC.2007.4424483