DocumentCode
3394449
Title
Reassessing the limit of data integration for the prediction of protein-protein interactions in Saccharomyces cerevisiae
Author
Browne, Fiona ; Wang, Haiying ; Zheng, Huiru ; Azuaje, Francisco
Author_Institution
Sch. of Comput. & Math., Univ. of Ulster, Belfast
fYear
2008
fDate
15-17 Sept. 2008
Firstpage
128
Lastpage
135
Abstract
This paper investigates the integration of functional genomic data for the prediction of protein-protein interactions (PPI) in Saccharomyces cerevisiae. A previous benchmark study observed a marginal increase in predictive power when integrating diverse features. Classification performance was evaluated using the Receiver Operating Characteristic (ROC) curve. In this study we propose the implementation of a likelihood ratio based Bayesian classifier to reassess the limits of genomic integration. The classifier combines seven genomic features ranging from co-expression to essentiality. Due to the imbalance of the dataset in this study, ROC curves may present an overly optimistic view of the classification performance. We use the true positive/false positive (TP/FP) rate and sensitivity as comparative predictive measures to the ROC curve. Predicted interactions are verified using a Gold Standard constructed from the Munich Database of Interacting Proteins Complex Catalogue. Using the measures TP/FP and sensitivity, a clear increase in classification performance was observed with the integration of features. This framework could be extended to the analysis of PPI in more complex organisms such as Drosophila melanogaster and Homo sapiens.
Keywords
belief networks; biology computing; genomics; pattern classification; proteins; Bayesian classifier; Drosophila melanogaster; Homo sapiens; Munich database; Saccharomyces cerevisiae; functional genomic data; interacting protein complex catalogue; protein-protein interactions; receiver operating characteristic curve; Bayesian methods; Bioinformatics; Biology computing; Computer networks; Genomics; Large-scale systems; Proteins; Radio frequency; Sensitivity; Spatial databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence in Bioinformatics and Computational Biology, 2008. CIBCB '08. IEEE Symposium on
Conference_Location
Sun Valley, ID
Print_ISBN
978-1-4244-1778-0
Electronic_ISBN
978-1-4244-1779-7
Type
conf
DOI
10.1109/CIBCB.2008.4675769
Filename
4675769
Link To Document