DocumentCode :
2079509
Title :
Efficient mining from heterogeneous data sets for predicting protein-protein interactions
Author :
Mamitsuka, Hiroshi
Author_Institution :
Inst. for Chem. Res., Kyoto Univ., Uji, Japan
fYear :
2003
fDate :
1-5 Sept. 2003
Firstpage :
32
Lastpage :
36
Abstract :
One of the most important issues in current molecular biology is to build exact networks of protein-protein interactions from currently available biological knowledge and information. We describe and demonstrate the effectiveness of a method for the issue of predicting protein-protein interactions, using a stochastic model as model for combining the data of protein-protein interactions with existing knowledge of proteins. In this paper, we consider a classification of proteins as the knowledge, and in a normally available classification of proteins, a protein falls into multiple classes. Focusing on this property of protein classes, we use the class of proteins as a latent variable in the stochastic model and estimate the model parameters with both the interaction data and protein classes using time-efficient EM (Expectation-Maximization) algorithm. We evaluate the method with the experiment using actual protein-protein interactions and a classification of proteins, and experimental results have shown that the method significantly outperformed other methods tested in our experiments.
Keywords :
biology computing; data mining; molecular biophysics; optimisation; parameter estimation; proteins; stochastic processes; EM algorithm; Expectation-Maximization algorithm; data mining; heterogeneous data sets; interaction networks; molecular biology; parameter estimation; protein classification; protein-protein interaction prediction; stochastic model; Biological system modeling; Chemicals; Computational biology; Conferences; Data mining; Parameter estimation; Predictive models; Proteins; Stochastic processes; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications, 2003. Proceedings. 14th International Workshop on
ISSN :
1529-4188
Print_ISBN :
0-7695-1993-8
Type :
conf
DOI :
10.1109/DEXA.2003.1231994
Filename :
1231994
Link To Document :
بازگشت