Title :
Improved Prediction of Relative Solvent Accessibility Using Two-stage Support Vector Regression
Author :
Chen, Ke ; Kurgan, Michal ; Kurgan, Lukasz
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Alberta, Edmonton, AB
Abstract :
Predicted relative solvent accessibility (RSA) provides useful information for prediction of binding sites and reconstruction of the 3D-structure based on a protein sequence, which are at the very core of proteomics. Several RSA prediction methods including those that generate real values and those that predict discrete states (buried vs. exposed) have been published. We propose a novel method for real valued prediction that aims to improve the prediction quality when compared with the existing methods. The proposed method combines Support Vector Regression (SVR) predictors into a two-stage architecture. The improved prediction quality comes from a composite sequence representation, which includes a custom-selected subset of features from the PSTBLAST profile, secondary structure predicted with PSTPRED, and binary code that indicates position of a given residue with respect to sequence termini. Based on empirical evaluation with a standard benchmark dataset, the proposed method obtains the mean absolute error (MAE) equal 0.143, which corresponds to 6% error rate reduction when compared with the best performing competing method that obtains 0.152 MAE on this dataset.
Keywords :
binary codes; biology computing; molecular biophysics; molecular configurations; proteins; support vector machines; 3D-structure reconstruction; PSI-BLAST profile; PSI-PRED profile; benchmark dataset; binary code; binding sites; mean absolute error; protein sequence; proteomics; relative solvent accessibility prediction; secondary protein structure; two-stage architecture; two-stage support vector regression; Binary codes; Error analysis; Linear regression; Neural networks; Prediction methods; Protein sequence; Proteomics; Solvents; Support vector machine classification; Support vector machines;
Conference_Titel :
Bioinformatics and Biomedical Engineering, 2007. ICBBE 2007. The 1st International Conference on
Conference_Location :
Wuhan
Print_ISBN :
1-4244-1120-3
DOI :
10.1109/ICBBE.2007.13