DocumentCode :
2847508
Title :
Mining cross-graph quasi-cliques in gene expression and protein interaction data
Author :
Pei, Jian ; Jiang, Daxin ; Zhang, Aidong
Author_Institution :
Simon Fraser Univ., Burnaby, BC, Canada
fYear :
2005
fDate :
5-8 April 2005
Firstpage :
353
Lastpage :
356
Abstract :
A protein is the product of a gene. From the gene expression data, we can find co-expressed genes, which are groups of genes that demonstrate coherent patterns on samples. On the other hand, from the protein interaction data, we can find groups of proteins that frequently interact with each other. If we can conduct a joint mining of both gene expression data and protein interaction data, then we may find the clusters of genes that are co-expressed and also their proteins interact. Such clusters found from the joint mining are interesting and meaningful for at least two reasons. First, both the gene expression data and the protein data are very noisy. The clusters confirmed by both data sets will strongly indicate the correlation/connection among the genes in a cluster. In other words, the clusters found from the joint mining are more reliable. We may thus have the high confidence that the genes in a cluster found as such are regulated by the same mechanism or belong to the same biological process. Second, although highly related, gene expression data and protein interaction data still carry different biological meaning. The coincidence of co-expressed genes and interacting proteins is biologically significant. As indicated in [5], many pathways exhibit two properties: their genes exhibit a similar gene expression profile, and the protein products of the genes often interact.
Keywords :
biology computing; data mining; database management systems; genetics; pattern clustering; proteins; gene expression data; joint mining; protein; protein interaction data; Biological processes; Biological system modeling; Euclidean distance; Gene expression; Loss measurement; Proteins;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
ISSN :
1084-4627
Print_ISBN :
0-7695-2285-8
Type :
conf
DOI :
10.1109/ICDE.2005.87
Filename :
1410140
Link To Document :
بازگشت