• DocumentCode
    3394449
  • Title

    Reassessing the limit of data integration for the prediction of protein-protein interactions in Saccharomyces cerevisiae

  • Author

    Browne, Fiona ; Wang, Haiying ; Zheng, Huiru ; Azuaje, Francisco

  • Author_Institution
    Sch. of Comput. & Math., Univ. of Ulster, Belfast
  • fYear
    2008
  • fDate
    15-17 Sept. 2008
  • Firstpage
    128
  • Lastpage
    135
  • Abstract
    This paper investigates the integration of functional genomic data for the prediction of protein-protein interactions (PPI) in Saccharomyces cerevisiae. A previous benchmark study observed a marginal increase in predictive power when integrating diverse features. Classification performance was evaluated using the Receiver Operating Characteristic (ROC) curve. In this study we propose the implementation of a likelihood ratio based Bayesian classifier to reassess the limits of genomic integration. The classifier combines seven genomic features ranging from co-expression to essentiality. Due to the imbalance of the dataset in this study, ROC curves may present an overly optimistic view of the classification performance. We use the true positive/false positive (TP/FP) rate and sensitivity as comparative predictive measures to the ROC curve. Predicted interactions are verified using a Gold Standard constructed from the Munich Database of Interacting Proteins Complex Catalogue. Using the measures TP/FP and sensitivity, a clear increase in classification performance was observed with the integration of features. This framework could be extended to the analysis of PPI in more complex organisms such as Drosophila melanogaster and Homo sapiens.
  • Keywords
    belief networks; biology computing; genomics; pattern classification; proteins; Bayesian classifier; Drosophila melanogaster; Homo sapiens; Munich database; Saccharomyces cerevisiae; functional genomic data; interacting protein complex catalogue; protein-protein interactions; receiver operating characteristic curve; Bayesian methods; Bioinformatics; Biology computing; Computer networks; Genomics; Large-scale systems; Proteins; Radio frequency; Sensitivity; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Bioinformatics and Computational Biology, 2008. CIBCB '08. IEEE Symposium on
  • Conference_Location
    Sun Valley, ID
  • Print_ISBN
    978-1-4244-1778-0
  • Electronic_ISBN
    978-1-4244-1779-7
  • Type

    conf

  • DOI
    10.1109/CIBCB.2008.4675769
  • Filename
    4675769