Title :
Predicting Protein Structural Class Based on Ensemble Binary Classification
Author :
Chen, Yuehui ; Li, Wei ; Cai, Nana
Author_Institution :
Sch. of Inf. Sci. & Eng., Univ. of Jinan, Jinan, China
Abstract :
With resent advances in deriving protein sequence´s features, a new feature has been proposed for the purpose of enhancing the prediction quality, which named Quasi-sequence-order. These descriptors are derived from both the Schneider-Wrede physicochemical distance matrix and the Grantham chemical distance matrix between each pair of the 20 amino acids. The feature was taken as the input of a new designed neural network to develop statistical learning models for predicting the protein structural class. The statistical model is designed to reduce four classes to six binary classifiers, and then by ensemble the six classifiers to get the final prediction. It was experiment through the rigorous jackknife cross validation text that the success rates by our method were significantly improved.
Keywords :
biology computing; data analysis; learning (artificial intelligence); matrix algebra; neural nets; pattern classification; proteins; statistical analysis; Grantham chemical distance matrix; Schneider-Wrede physicochemical distance matrix; amino acids; ensemble binary classification; jackknife cross validation text; neural network; prediction quality; protein sequence feature; protein structural class prediction; quasi-sequence-order; statistical learning model; Amino acids; Chemicals; Information science; Neural networks; Predictive models; Protein engineering; Protein sequence; Sequences; Support vector machine classification; Support vector machines; Grantham chemical distance matrix; Protein Structure Prediction; Schneider-Wrede physicochemical distance matrix;
Conference_Titel :
Natural Computation, 2009. ICNC '09. Fifth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3736-8
DOI :
10.1109/ICNC.2009.399