DocumentCode :
3123735
Title :
Exploiting Domain Knowledge to Improve Biological Significance of Biclusters with Key Missing Genes
Author :
Chen, Jin ; Ji, Liping ; Hsu, Wynne ; Tan, Kian-Lee ; Rhee, Seung Y.
Author_Institution :
Dept. of Plant Biol., Carnegie Instn. for Sci., Stanford, CA
fYear :
2009
fDate :
March 29 2009-April 2 2009
Firstpage :
1219
Lastpage :
1222
Abstract :
In an era of increasingly complex biological datasets, one of the key steps in gene functional analysis comes from clustering genes based on co-expression. Biclustering algorithms can identify gene clusters with local co-expressed patterns, which are more likely to define genes functioning together than global clustering methods. However, these algorithms are not effective in uncovering gene regulatory networks because the mined biclusters lack genes that may be critical in the function but may not be co-expressed with the clustered genes. In this paper, we introduce a biclustering method called skeleton biclustering (SKB), which builds high quality biclusters from microarray data, creates relationships among the biclustered genes based on gene ontology annotations, and identifies genes that are missing in the biclusters. SKB thus defines inter-bicluster and intra-bicluster functional relationships. The delineation of functional relationships and incorporation of such missing genes may help biologists to discover biological processes that are important in a given study and provides clues for how the processes may be functioning together. Experimental results show that, with SKB, the biological significance of the biclusters is considerably improved.
Keywords :
biology computing; genetics; ontologies (artificial intelligence); biclustering algorithms; biological datasets; gene expression clustering; gene functional analysis; gene ontology annotations; gene regulatory networks; key missing genes; microarray data; skeleton biclustering; Biology; Clustering algorithms; Clustering methods; Computer science; Data engineering; Functional analysis; Genetics; Ontologies; Plants (biology); Skeleton; clustering; gene expression; gene ontology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
ISSN :
1084-4627
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
Type :
conf
DOI :
10.1109/ICDE.2009.205
Filename :
4812505
Link To Document :
بازگشت