DocumentCode :
2223873
Title :
Multi-objective evolutionary algorithm for biclustering in microarrays data
Author :
Seridi, Khedidja ; Jourdan, Laetitia ; Talbi, El-Ghazali
Author_Institution :
LIFL, INRIA Lille-Nord Eur., Villeneuve-d´´Ascq, France
fYear :
2011
fDate :
5-8 June 2011
Firstpage :
2593
Lastpage :
2599
Abstract :
Microarrays are a powerful tool in studying genes expressions under several conditions. The obtained data need to be analyzed using data mining methods. Biclustering is a data mining method which consists in simultaneous clustering of rows and columns in a data matrix. Using biclustering, we can extract genes that have similar behavior (co-express) under specific conditions. These genes may share identical biological functions. The aim in analyzing gene expression data is the extraction of maximal number of genes and conditions that present similar behavior. The two objectives to be optimized (size and similarity) are conflicting. Therefore, multi-objective optimization is suitable for biclustering. In our work, we combine a well-known multi-objective genetic algorithm (NSGA-II) with a heuristic to solve the biclutering problem. Due to the huge size of the datasets, we use a string of integers as a solution representation where integers represent the indexes of the rows and the columns. Experimental results on real data set show that our approach can find significant biclusters of high quality.
Keywords :
biology computing; data mining; genetic algorithms; pattern clustering; biclustering problem; data matrix; data mining methods; gene expression data; microarrays data; multiobjective evolutionary algorithm; multiobjective genetic algorithm; multiobjective optimization; Approximation methods; Data mining; Evolutionary computation; Genetic expression; Humans; Optimization; Search problems; Biclustering; Mi-croarray data; Multi-objective optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation (CEC), 2011 IEEE Congress on
Conference_Location :
New Orleans, LA
ISSN :
Pending
Print_ISBN :
978-1-4244-7834-7
Type :
conf
DOI :
10.1109/CEC.2011.5949941
Filename :
5949941
Link To Document :
بازگشت