DocumentCode
65087
Title
Intelligent Consensus Modeling for Proline Cis-Trans Isomerization Prediction
Author
Yoo, Paul D. ; Muhaidat, Sami ; Taha, Kamal ; Bentahar, Jamal ; Shami, A.
Author_Institution
Dept. of Electr. & Comput. Eng., Khalifa Univ., Abu Dhabi, United Arab Emirates
Volume
11
Issue
1
fYear
2014
fDate
Jan.-Feb. 2014
Firstpage
26
Lastpage
32
Abstract
Proline cis-trans isomerization (CTI) plays a key role in the rate-determining steps of protein folding. Accurate prediction of proline CTI is of great importance for the understanding of protein folding, splicing, cell signaling, and transmembrane active transport in both the human body and animals. Our goal is to develop a state-of-the-art proline CTI predictor based on a biophysically motivated intelligent consensus modeling through the use of sequence information only (i.e., position specific scores generated by PSI-BLAST). The current computational proline CTI predictors reach about 70-73 percent Q2 accuracies and about 0.40 Matthew correlation coefficient (Mcc) through the use of sequence-based evolutionary information as well as predicted protein secondary structure information. However, our approach that utilizes a novel decision tree-based consensus model with a powerful randomized-metal earning technique has achieved 86.58 percent Q2 accuracy and 0.74 Mcc, on the same proline CTI data set, which is a better result than those of any existing computational proline CTI predictors reported in the literature.
Keywords
biochemistry; biology computing; biomembrane transport; decision trees; evolution (biological); evolutionary computation; isomerisation; learning (artificial intelligence); molecular biophysics; molecular configurations; Matthew correlation coefficient; PSI-BLAST; animals; biophysically motivated intelligent consensus modeling; cell signaling; current computational proline CTI predictors; human body; novel decision tree-based consensus model; position specific scores; powerful randomized-metalearning technique; predicted protein secondary structure information; proline cis-trans isomerization prediction; protein folding; rate-determining steps; sequence information; sequence-based evolutionary information; splicing; state-of-the-art proline CTI predictor; transmembrane active transport; Accuracy; Computational modeling; Data models; Error analysis; Predictive models; Proteins; Support vector machines; Proline cis-trans isomerization; ensemble methods; intelligent systems; machine-learning;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2013.132
Filename
6646170
Link To Document