DocumentCode :
1305283
Title :
A Max-Flow-Based Approach to the Identification of Protein Complexes Using Protein Interaction and Microarray Data
Author :
Feng, Jianxing ; Jiang, Rui ; Jiang, Tao
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Volume :
8
Issue :
3
fYear :
2011
Firstpage :
621
Lastpage :
634
Abstract :
The emergence of high-throughput technologies leads to abundant protein-protein interaction (PPI) data and microarray gene expression profiles, and provides a great opportunity for the identification of novel protein complexes using computational methods. By combining these two types of data, we propose a novel Graph Fragmentation Algorithm (GFA) for protein complex identification. Adapted from a classical max-flow algorithm for finding the (weighted) densest subgraphs, GFA first finds large (weighted) dense subgraphs in a protein-protein interaction network, and then, breaks each such subgraph into fragments iteratively by weighting its nodes appropriately in terms of their corresponding log-fold changes in the microarray data, until the fragment subgraphs are sufficiently small. Our tests on three widely used protein-protein interaction data sets and comparisons with several latest methods for protein complex identification demonstrate the strong performance of our method in predicting novel protein complexes in terms of its specificity and efficiency. Given the high specificity (or precision) that our method has achieved, we conjecture that our prediction results imply more than 200 novel protein complexes.
Keywords :
bioinformatics; data analysis; genetics; genomics; graph theory; molecular biophysics; proteins; graph fragmentation algorithm; max-flow algorithm; microarray gene expression profile; protein complex identification; protein-protein interaction; Benchmark testing; Bioinformatics; Electronics packaging; Gene expression; Prediction algorithms; Proteins; Protein complex; dense subgraph; efficient algorithm.; maximum network flow; microarray; protein-protein interaction network; Algorithms; Computational Biology; Databases, Protein; Gene Expression Profiling; Protein Array Analysis; Protein Interaction Mapping; Reproducibility of Results; Saccharomyces cerevisiae Proteins;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2010.78
Filename :
5557852
Link To Document :
بازگشت