Title :
Incorporating Protein-Protein Interactions Knowledge in Clustering Gene Expression Data
Author :
Li, Gangguo ; Wang, Zhengzhi
Author_Institution :
Inst. of Autom., Nat. Univ. of Defense Technol., Changsha
Abstract :
In this paper, a similarity measure between genes with protein-protein interactions is proposed. On the basis of it, the combined dissimilarity measure is defined. The combined distance measure is introduced into K-means method, which can be considered as an improved K-means method. The improved K-means method and other three clustering methods are evaluated by a real dataset. Performance of these methods is assessed by a prediction accuracy analysis through known gene annotations. Our results show that the improved K-means method outperforms other clustering methods. The performance of the improved K-means method is also tested by varying the tuning parameter of the combined dissimilarity measure. The results show that when the tuning parameter decreases, the performance increases. Finally, a framework of integration of various biological prior knowledge and gene expression data is proposed.
Keywords :
biology computing; genetics; molecular biophysics; pattern clustering; proteins; sensor fusion; K-means method; clustering gene expression data; protein-protein interactions; tuning parameter; Automation; Bioinformatics; Clustering algorithms; Clustering methods; Databases; Gene expression; Genetics; Genomics; Partitioning algorithms; Proteins;
Conference_Titel :
Bioinformatics and Biomedical Engineering, 2008. ICBBE 2008. The 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-1747-6
Electronic_ISBN :
978-1-4244-1748-3
DOI :
10.1109/ICBBE.2008.56