DocumentCode :
3275685
Title :
Modeling biclustering as an optimization problem using mutual information
Author :
Gupta, Neelima ; Aggarwal, Seema
Author_Institution :
Dept. of Comput. Sci., Univ. of Delhi, Delhi, India
fYear :
2009
fDate :
14-15 Dec. 2009
Firstpage :
1
Lastpage :
5
Abstract :
Most of the biclustering algorithms for the analysis of high dimensional gene expression data use some distance measure or correlation coefficient between a pair of genes as the similarity measure. These measures capture only linear relationships between the genes but non linear relationships may exist amongst them. Mutual information is a more general measure to investigate relationships (positive, negative correlation and non linear relationships as well). Biclustering problem has been modeled as an optimization problem and a polynomial time solution has been proposed for it. In this paper we use Mutual Information to model Biclustering as an optimization problem. Results on gene expression data of Arabidopsis Thaliana show that our method produces different yet biologically significant biclusters. It was found that our biclusters had better p values as compared to the biclusters obtained by the other existing algorithms. Also promoter region of the genes of most of the biclusters were found to have common motif patterns.
Keywords :
biology computing; genetics; optimisation; pattern clustering; Arabidopsis Thaliana; biclustering optimization problem; biologically significant biclusters; common motif patterns; correlation coefficient; distance measure; high dimensional gene expression data; mutual information; negative correlation; nonlinear relationship; polynomial time solution; positive correlation; Background noise; Biological information theory; Biological system modeling; Clustering algorithms; Computer science; Data mining; Gene expression; Mutual information; Noise measurement; Optimization methods; Biclustering; GO term and Transcription factor binding site; Gene Expression Data; Mutual Information;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Methods and Models in Computer Science, 2009. ICM2CS 2009. Proceeding of International Conference on
Conference_Location :
Delhi
Print_ISBN :
978-1-4244-5051-0
Type :
conf
DOI :
10.1109/ICM2CS.2009.5397969
Filename :
5397969
Link To Document :
بازگشت