• DocumentCode
    34550
  • Title

    Randomized Subspace Learning for Proline Cis-Trans Isomerization Prediction

  • Author

    Al-Jarrah, Omar Y. ; Yoo, Paul D. ; Taha, Kamal ; Muhaidat, Sami ; Shami, Abdallah ; Zaki, Nazar

  • Author_Institution
    ECE Dept., Khalifa Univ., Abu Dhabi, United Arab Emirates
  • Volume
    12
  • Issue
    4
  • fYear
    2015
  • fDate
    July-Aug. 1 2015
  • Firstpage
    763
  • Lastpage
    769
  • Abstract
    Proline residues are common source of kinetic complications during folding. The X-Pro peptide bond is the only peptide bond for which the stability of the cis and trans conformations is comparable. The cis-trans isomerization (CTI) of X-Pro peptide bonds is a widely recognized rate-limiting factor, which can not only induces additional slow phases in protein folding but also modifies the millisecond and sub-millisecond dynamics of the protein. An accurate computational prediction of proline CTI is of great importance for the understanding of protein folding, splicing, cell signaling, and transmembrane active transport in both the human body and animals. In our earlier work, we successfully developed a biophysically motivated proline CTI predictor utilizing a novel tree-based consensus model with a powerful metalearning technique and achieved 86.58 percent Q2 accuracy and 0.74 Mcc, which is a better result than the results (70-73 percent Q2 accuracies) reported in the literature on the well-referenced benchmark dataset. In this paper, we describe experiments with novel randomized subspace learning and bootstrap seeding techniques as an extension to our earlier work, the consensus models as well as entropy-based learning methods, to obtain better accuracy through a precise and robust learning scheme for proline CTI prediction.
  • Keywords
    biology computing; biomembrane transport; entropy; isomerisation; learning (artificial intelligence); molecular biophysics; molecular configurations; proteins; Q2 accuracy; X-Pro peptide bond; accurate computational prediction; benchmark dataset; biophysically motivated proline CTI predictor; bootstrap seeding techniques; cell signaling; cis conformation stability; entropy-based learning methods; kinetic complications; metalearning technique; millisecond dynamics; proline CTI; proline cis-trans isomerization prediction; protein folding; protein splicing; randomized subspace learning; recognized rate-limiting factor; submillisecond dynamics; trans conformation stability; transmembrane active transport; tree-based consensus model; Accuracy; Bioinformatics; Computational biology; IEEE transactions; Peptides; Proteins; Support vector machines; Proline cis-trans isomerization; ensemble methods; machine learning; proline cis-trans isomerization; subspace learning;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2014.2369040
  • Filename
    6951423