Title :
Disulfide Bonding Pattern Prediction Server Based on Normalized Pair Distance by MODELLER
Author :
Lin, Hsuan-Hung ; Hsu, Jiin-Chyr ; Chen, Yung-Fu
Author_Institution :
Dept. of Manage. Inf. Syst., Central Taiwan Univ. of Sci. & Technol., Taichung, Taiwan
Abstract :
Prediction of the protein structure is one of the most important problems in the computational biology, and it remains one of the biggest challenges in the structural biology. Disulfide bonds play an import structural role in stabilizing protein conformations. For the protein-folding prediction, a correct prediction of disulfide bridges can greatly reduce the search space. The prediction of disulfide bonding pattern helps, to a certain degree, predicts the 3D structure of a protein and hence its function since disulfide bonds imposes geometrical constraints on the protein backbones. Then, the protein 3D structure related features called normalized pair distance (NPD) vector were imposed as the features for designing the classifier based on the support vector machine (SVM). The classifier was trained to compute the connectivity probabilities of cysteine pairs. In addition, a genetic algorithm was integrated with the SVM model to tune the parameters of the SVM and the window sizes for the features. The maximum weighted perfect matching algorithm was then used to find the disulfide connectivity pattern. In this study, the experimental results show that the accuracies rate reaches 91.7% for the prediction of the overall disulfide connectivity pattern (QP) and that of disulfide bridge prediction (QC) is 94.2% for dataset SP39.
Keywords :
biology computing; genetic algorithms; pattern classification; support vector machines; MODELLER; NPD; SVM; computational biology; connectivity probabilities; cysteine pairs; disulfide bonding pattern prediction server; disulfide bonds; disulfide bridges; disulfide connectivity pattern; genetic algorithm; geometrical constraints; maximum weighted perfect matching algorithm; normalized pair distance; protein backbones; protein conformations; protein folding prediction; protein structure; search space; structural biology; support vector machine; Accuracy; Bonding; Bridges; Genetic algorithms; Neural networks; Proteins; Support vector machines; disulfide bonding pattern; genetic algorithm; support vector machine;
Conference_Titel :
Computer, Consumer and Control (IS3C), 2012 International Symposium on
Conference_Location :
Taichung
Print_ISBN :
978-1-4673-0767-3
DOI :
10.1109/IS3C.2012.152