DocumentCode
3460269
Title
Biological Features for Sequence-Based Prediction of Protein Stability Changes upon Amino Acid Substitutions
Author
Teng, Shaolei ; Srivastava, Anand K. ; Wang, Liangjiang
Author_Institution
Dept. of Genetics & Biochem., Clemson Univ., Clemson, SC, USA
fYear
2009
fDate
3-5 Aug. 2009
Firstpage
201
Lastpage
206
Abstract
Protein destabilization is a common mechanism by which amino acid substitutions cause human diseases. In this study, a new machine learning method has been developed for sequence-based prediction of protein stability changes upon single amino acid substitutions. Support vector machines were trained with data from experimental studies on the free energy change of protein stability upon mutations. To construct accurate classifiers, twenty biological features were examined for input vector encoding. It was shown that classifier performance varied significantly by the use of different features. The most accurate classifier was constructed using a combination of several biological features. This classifier achieved an overall accuracy of 82.24% with 75.24% sensitivity and 85.36% specificity. Predictive results at this level of accuracy may be used in human genetic studies to distinguish between deleterious and tolerant alterations in disease candidate genes.
Keywords
diseases; encoding; learning (artificial intelligence); medical computing; pattern classification; proteins; support vector machines; amino acid substitution; biological feature; human disease; human genetic studies; machine learning method; pattern classifier; protein destabilization; protein stability change prediction; sequence-based prediction; support vector machines; tolerant gene alteration; vector encoding; Amino acids; Biological information theory; Diseases; Genetic mutations; Humans; Learning systems; Protein engineering; Stability; Support vector machine classification; Support vector machines; biological feature selection; machine learning; protein stabiligy prediction; support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics, Systems Biology and Intelligent Computing, 2009. IJCBS '09. International Joint Conference on
Conference_Location
Shanghai
Print_ISBN
978-0-7695-3739-9
Type
conf
DOI
10.1109/IJCBS.2009.101
Filename
5260696
Link To Document