Title :
Prediction of protein secondary structure using large margin nearest neighbor classification
Author :
Yang, Wei ; Wang, Kuanquan ; Zuo, Wangmeng
Author_Institution :
Biocomput. Res. Centre, Harbin Inst. of Technol., Harbin, China
Abstract :
Prediction of protein secondary structure from a primary sequence plays a critical role in structural biology. In this paper, we introduce a novel method for protein secondary structure prediction by using PSSM profiles and large margin nearest neighbor classification. Although the PSSM profiles and traditional nearest neighbor (NN) method can be directly used to predict secondary structure, since the PSSM profiles are not specifically designed for protein secondary structure prediction, the NN method could not achieve satisfactory prediction accuracy. To addressing this problem, we use a large margin nearest neighbor model to learn a Mahalanobis distance metric via convex semidefinite programming for nearest neighbor classification. Then, an energy-based rule is invoked to assign secondary structure. Tests show that, compared with other NN methods, significant performance improvement has been achieved with respect to prediction accuracy by the proposed method.
Keywords :
bioinformatics; convex programming; pattern classification; proteins; Mahalanobis distance metric; PSSM profiles; convex semidefinite programming; large margin nearest neighbor classification; protein secondary structure; structural biology; Fasteners; Periodic structures; Predictive models; Nearest neighbor; distance metric; large margin; protein secondary structure prediction;
Conference_Titel :
Advanced Computer Control (ICACC), 2011 3rd International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-8809-4
Electronic_ISBN :
978-1-4244-8810-0
DOI :
10.1109/ICACC.2011.6016397