Title :
Protein structural class prediction using support vector machine
Author :
Shafiullah, Gazi Mohammad ; Al-Mamun, Hawlader Abdullah
Author_Institution :
Dept. of Comput. Sci. & Inf. Technol., Islamic Univ. of Technol., Gazipur, Bangladesh
Abstract :
Protein structural class prediction can play a vital role in protein 3-D structure prediction by reducing the search space of 3-D structure prediction algorithms. In this paper we used support vector machine to predict protein structural class solely based of its amino acid sequences, i.e. mainly α, mainly β, α- β and fss from CATH protein structure database; all-α, all-β, α/β, α+β from SCOP protein structure database. Four different datasets were used in this paper among them two were constructed using a unique way called Representative Protein Extraction method. During the training phase for the binary classification 99.91% accuracy was achieved for fss vs. others. Also during the testing phase for SCOP database the overall prediction accuracy was 97.14% whereas for CATH database it was 96%. The results obtained in this study are quite encouraging, indicating that it can be used as a complimentary method for protein class prediction to many other existing methods.
Keywords :
biology computing; pattern classification; proteins; support vector machines; CATH protein structure database;; SCOP protein structure database; amino acid sequences; binary classification; protein 3D structure prediction; protein structural class prediction; representative protein extraction method; support vector machine; CATH database; Protein structural class; SCOP database; Support vector machine;
Conference_Titel :
Electrical and Computer Engineering (ICECE), 2010 International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4244-6277-3
DOI :
10.1109/ICELCE.2010.5700657