Title of article :
Using stacked generalization to predict membrane protein types based on pseudo-amino acid composition
Author/Authors :
Wang، نويسنده , , Shuang-Quan and Yang، نويسنده , , Zhi-Jie and Chou، نويسنده , , Kuo-Chen، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2006
Pages :
6
From page :
941
To page :
946
Abstract :
Membrane proteins are vitally important for many biological processes and have become an attractive target for both basic research and drug design. Knowledge of membrane protein types often provides useful clues in deducing the functions of uncharacterized membrane proteins. With the unprecedented increasing of newly found protein sequences in the post-genomic era, it is highly demanded to develop an automated method for fast and accurately identifying the types of membrane proteins according to their amino acid sequences. Although quite a few identifiers have been developed in this regard through various approaches, such as covariant discriminant (CD), support vector machine (SVM), artificial neural network (ANN), and K-nearest neighbor (KNN), classifier the way they operate the identification is basically individual. As is well known, wise persons usually take into account the opinions from several experts rather than rely on only one when they are making critical decisions. Likewise, a sophisticated identifier should be trained by several different modes. In view of this, based on the frame of pseudo-amino acid that can incorporate a considerable amount of sequence-order effects, a novel approach called “stacked generalization” or “stacking” has been introduced. Unlike the “bagging” and “boosting” approaches which only combine the classifiers of a same type, the stacking approach can combine several different types of classifiers through a meta-classifier to maximize the generalization accuracy. The results thus obtained were very encouraging. It is anticipated that the stacking approach may also hold a high potential to improve the identification quality for, among many other protein attributes, subcellular location, enzyme family class, protease type, and protein–protein interaction type. The stacked generalization classifier is available as a web-server named “SG-MPt_Pred” at: http://202.120.37.186/bioinf/wangsq/service.htm.
Keywords :
Membrane protein types , stacking , Instance-based learning , Nonlinear combination , Pseudo-amino acid composition , Support vector machine
Journal title :
Journal of Theoretical Biology
Serial Year :
2006
Journal title :
Journal of Theoretical Biology
Record number :
1538073
Link To Document :
بازگشت