DocumentCode :
2564259
Title :
Two-stage support vector machines to protein relative solvent accessibility prediction
Author :
Nguyen, Minh N. ; Rajapakse, Jagath C.
Author_Institution :
Bioinformatics Res. Centre, Nanyang Technol. Univ., Singapore
fYear :
2004
fDate :
7-8 Oct. 2004
Firstpage :
67
Lastpage :
72
Abstract :
Bioinformatics techniques to relative solvent accessibility (RSA) prediction are mostly single-stage approaches; they predict solvent accessibility of proteins by taking into account only the information available in amino acid sequences. We propose to use support vector machines (SVMs) as a second stage following the existing single-stage approaches for RSA prediction problem to improve the accuracy. The purpose of the second stage is to capture the contextual relationship of solvent accessibility elements in a neighborhood in determining the solvent accessibility at a particular site. We demonstrate our approach by introducing SVMs to the output of single-stage SVM classifier. Two-stage SVM approach achieves accuracies up to 90.4% and 90.2% on the Manesh dataset of 215 protein structures and the RS126 dataset of 126 nonhomologous globular proteins, respectively, which are better than the highest reported scores on both datasets to date.
Keywords :
biology computing; molecular biophysics; proteins; support vector machines; PSI-BLAST; RS126 dataset; SVM; amino acid sequences; bioinformatics techniques; nonhomologous globular protein; protein structure; relative solvent accessibility prediction; two-stage support vector machines; Amino acids; Bayesian methods; Biochemistry; Chromium; Neural networks; Organisms; Proteins; Solvents; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology, 2004. CIBCB '04. Proceedings of the 2004 IEEE Symposium on
Print_ISBN :
0-7803-8728-7
Type :
conf
DOI :
10.1109/CIBCB.2004.1393934
Filename :
1393934
Link To Document :
بازگشت