• DocumentCode
    3460269
  • Title

    Biological Features for Sequence-Based Prediction of Protein Stability Changes upon Amino Acid Substitutions

  • Author

    Teng, Shaolei ; Srivastava, Anand K. ; Wang, Liangjiang

  • Author_Institution
    Dept. of Genetics & Biochem., Clemson Univ., Clemson, SC, USA
  • fYear
    2009
  • fDate
    3-5 Aug. 2009
  • Firstpage
    201
  • Lastpage
    206
  • Abstract
    Protein destabilization is a common mechanism by which amino acid substitutions cause human diseases. In this study, a new machine learning method has been developed for sequence-based prediction of protein stability changes upon single amino acid substitutions. Support vector machines were trained with data from experimental studies on the free energy change of protein stability upon mutations. To construct accurate classifiers, twenty biological features were examined for input vector encoding. It was shown that classifier performance varied significantly by the use of different features. The most accurate classifier was constructed using a combination of several biological features. This classifier achieved an overall accuracy of 82.24% with 75.24% sensitivity and 85.36% specificity. Predictive results at this level of accuracy may be used in human genetic studies to distinguish between deleterious and tolerant alterations in disease candidate genes.
  • Keywords
    diseases; encoding; learning (artificial intelligence); medical computing; pattern classification; proteins; support vector machines; amino acid substitution; biological feature; human disease; human genetic studies; machine learning method; pattern classifier; protein destabilization; protein stability change prediction; sequence-based prediction; support vector machines; tolerant gene alteration; vector encoding; Amino acids; Biological information theory; Diseases; Genetic mutations; Humans; Learning systems; Protein engineering; Stability; Support vector machine classification; Support vector machines; biological feature selection; machine learning; protein stabiligy prediction; support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics, Systems Biology and Intelligent Computing, 2009. IJCBS '09. International Joint Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3739-9
  • Type

    conf

  • DOI
    10.1109/IJCBS.2009.101
  • Filename
    5260696