Title :
A two-step approach for clustering proteins based on protein interaction profile
Author :
Pengjun Pe ; Zhang, Aidong
Author_Institution :
Dept. of Comput. Sci. & Eng., State Univ. of New York, USA
Abstract :
High-throughput methods for detecting protein-protein interactions (PPI) have given researchers an initial global picture of protein interactions on a genomic scale. The huge data sets generated by such experiments pose new challenges in data analysis. Though clustering methods have been successfully applied in many areas in bioinformatics many clustering algorithms cannot be readily applied on protein interaction data sets. One main problem is that the similarity between two proteins cannot be easily defined. This paper proposes a probabilistic model to define the similarity based on conditional probabilities. We then propose a two-step method for estimating the similarity between two proteins based on protein interaction profile. In the first step, the model is trained with proteins with known annotation. Based on this model, similarities are calculated in the second step. Experiments show that our method improves performance.
Keywords :
biology computing; genetics; molecular biophysics; physiological models; proteins; statistical analysis; bioinformatics; conditional probabilities; genomics; probabilistic model; protein clustering; protein interaction profile; two-step approach; Bioinformatics; Biological processes; Clustering algorithms; Clustering methods; Computer science; Current measurement; Data analysis; Genomics; Predictive models; Protein engineering;
Conference_Titel :
Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
Print_ISBN :
0-7695-2476-1
DOI :
10.1109/BIBE.2005.10