Title :
Finding Multiple Coherent Biclusters in Microarray Data Using Variable String Length Multiobjective Genetic Algorithm
Author :
Maulik, Ujjwal ; Mukhopadhyay, Anirban ; Bandyopadhyay, Sanghamitra
Author_Institution :
Dept. of Comput. Sci. & Eng., Jadavpur Univ., Kolkata, India
Abstract :
Microarray technology enables the simultaneous monitoring of the expression pattern of a huge number of genes across different experimental conditions. Biclustering in microarray data is an important technique that discovers a group of genes that are coregulated in a subset of conditions. Biclustering algorithms require to identify coherent and nontrivial biclusters, i.e., the biclusters should have low mean squared residue and high row variance. A multiobjective genetic biclustering technique is proposed here that optimizes these objectives simultaneously. A novel encoding scheme that uses variable chromosome length is developed. Moreover, a new quantitative measure to evaluate the goodness of the biclusters is proposed. The performance of the proposed algorithm has been evaluated on both simulated and real-life gene expression datasets, and compared with some other well-known biclustering techniques.
Keywords :
biology computing; genetic algorithms; genetics; biclustering algorithm; genetic biclustering technique; mean squared residue; microarray data; multiple coherent biclusters; variable chromosome length; variable string length multiobjective genetic algorithm; Biclustering; mean squared residue (MSR); multiobjective genetic algorithm (GA); row variance; variable string length; Algorithms; Cluster Analysis; Computational Biology; Computer Simulation; Databases, Genetic; Gene Expression Profiling; Humans; Leukemia; Models, Genetic; Oligonucleotide Array Sequence Analysis; Yeasts;
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
DOI :
10.1109/TITB.2009.2017527