DocumentCode :
65087
Title :
Intelligent Consensus Modeling for Proline Cis-Trans Isomerization Prediction
Author :
Yoo, Paul D. ; Muhaidat, Sami ; Taha, Kamal ; Bentahar, Jamal ; Shami, A.
Author_Institution :
Dept. of Electr. & Comput. Eng., Khalifa Univ., Abu Dhabi, United Arab Emirates
Volume :
11
Issue :
1
fYear :
2014
fDate :
Jan.-Feb. 2014
Firstpage :
26
Lastpage :
32
Abstract :
Proline cis-trans isomerization (CTI) plays a key role in the rate-determining steps of protein folding. Accurate prediction of proline CTI is of great importance for the understanding of protein folding, splicing, cell signaling, and transmembrane active transport in both the human body and animals. Our goal is to develop a state-of-the-art proline CTI predictor based on a biophysically motivated intelligent consensus modeling through the use of sequence information only (i.e., position specific scores generated by PSI-BLAST). The current computational proline CTI predictors reach about 70-73 percent Q2 accuracies and about 0.40 Matthew correlation coefficient (Mcc) through the use of sequence-based evolutionary information as well as predicted protein secondary structure information. However, our approach that utilizes a novel decision tree-based consensus model with a powerful randomized-metal earning technique has achieved 86.58 percent Q2 accuracy and 0.74 Mcc, on the same proline CTI data set, which is a better result than those of any existing computational proline CTI predictors reported in the literature.
Keywords :
biochemistry; biology computing; biomembrane transport; decision trees; evolution (biological); evolutionary computation; isomerisation; learning (artificial intelligence); molecular biophysics; molecular configurations; Matthew correlation coefficient; PSI-BLAST; animals; biophysically motivated intelligent consensus modeling; cell signaling; current computational proline CTI predictors; human body; novel decision tree-based consensus model; position specific scores; powerful randomized-metalearning technique; predicted protein secondary structure information; proline cis-trans isomerization prediction; protein folding; rate-determining steps; sequence information; sequence-based evolutionary information; splicing; state-of-the-art proline CTI predictor; transmembrane active transport; Accuracy; Computational modeling; Data models; Error analysis; Predictive models; Proteins; Support vector machines; Proline cis-trans isomerization; ensemble methods; intelligent systems; machine-learning;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2013.132
Filename :
6646170
Link To Document :
بازگشت