• DocumentCode
    65087
  • Title

    Intelligent Consensus Modeling for Proline Cis-Trans Isomerization Prediction

  • Author

    Yoo, Paul D. ; Muhaidat, Sami ; Taha, Kamal ; Bentahar, Jamal ; Shami, A.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Khalifa Univ., Abu Dhabi, United Arab Emirates
  • Volume
    11
  • Issue
    1
  • fYear
    2014
  • fDate
    Jan.-Feb. 2014
  • Firstpage
    26
  • Lastpage
    32
  • Abstract
    Proline cis-trans isomerization (CTI) plays a key role in the rate-determining steps of protein folding. Accurate prediction of proline CTI is of great importance for the understanding of protein folding, splicing, cell signaling, and transmembrane active transport in both the human body and animals. Our goal is to develop a state-of-the-art proline CTI predictor based on a biophysically motivated intelligent consensus modeling through the use of sequence information only (i.e., position specific scores generated by PSI-BLAST). The current computational proline CTI predictors reach about 70-73 percent Q2 accuracies and about 0.40 Matthew correlation coefficient (Mcc) through the use of sequence-based evolutionary information as well as predicted protein secondary structure information. However, our approach that utilizes a novel decision tree-based consensus model with a powerful randomized-metal earning technique has achieved 86.58 percent Q2 accuracy and 0.74 Mcc, on the same proline CTI data set, which is a better result than those of any existing computational proline CTI predictors reported in the literature.
  • Keywords
    biochemistry; biology computing; biomembrane transport; decision trees; evolution (biological); evolutionary computation; isomerisation; learning (artificial intelligence); molecular biophysics; molecular configurations; Matthew correlation coefficient; PSI-BLAST; animals; biophysically motivated intelligent consensus modeling; cell signaling; current computational proline CTI predictors; human body; novel decision tree-based consensus model; position specific scores; powerful randomized-metalearning technique; predicted protein secondary structure information; proline cis-trans isomerization prediction; protein folding; rate-determining steps; sequence information; sequence-based evolutionary information; splicing; state-of-the-art proline CTI predictor; transmembrane active transport; Accuracy; Computational modeling; Data models; Error analysis; Predictive models; Proteins; Support vector machines; Proline cis-trans isomerization; ensemble methods; intelligent systems; machine-learning;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2013.132
  • Filename
    6646170