Title :
A New gene selection approach based on Minimum Redundancy-Maximum Relevance (MRMR) and Genetic Algorithm (GA)
Author :
Akadi, A. El ; Amine, A. ; Ouardighi, A. El ; Aboutajdine, D.
Author_Institution :
GSCM-LRIT, Univ. of Mohammed V, Rabat
Abstract :
Gene expression data usually contains a large number of genes (several thousand or more) but a small number of samples (usually <100). Among all the genes, many are irrelevant, insignificant or redundant to the discriminant problem under investigation. Hence the identification of informative genes, which have the greatest power for classification, is of fundamental and practical importance to the investigation of specific discriminant problems, such as cancer versus non-cancer or different tumor types. In this paper, we propose a two-stage selection algorithm by combining minimum redundancy maximum relevance (MRMR) and genetic algorithm (GA). In the first stage, MRMR is used to filter noisy and redundant genes in high dimensional microarray data. In the second stage, GA and SVM work together for selecting the highly discriminating genes. The proposed method is tested on five open datasets: NCI, Lymphoma, Lung, Leukemia and Colon. In the experiments, we compare our proposed MRMR-GA algorithm with GA-SVM wrapper and MRMR filter. The experimental results show that the proposed method has excellent selection and classification performances.
Keywords :
bioinformatics; cancer; feature extraction; genetic algorithms; genetics; pattern classification; support vector machines; tumours; SVM; cancer; gene expression data; gene selection approach; genetic algorithm; high dimensional microarray data; informative gene identification; minimum redundancy-maximum relevance method; pattern classification; tumor; Cancer; Colon; Diversity reception; Filters; Gene expression; Genetic algorithms; Lungs; Neoplasms; Support vector machines; Testing;
Conference_Titel :
Computer Systems and Applications, 2009. AICCSA 2009. IEEE/ACS International Conference on
Conference_Location :
Rabat
Print_ISBN :
978-1-4244-3807-5
Electronic_ISBN :
978-1-4244-3806-8
DOI :
10.1109/AICCSA.2009.5069306