Title :
Predicting Hepatitis C Virus Protease Cleavage Sites Using Generalized Linear Indicator Regression Models
Author_Institution :
Dept. of Comput. Sci., Exeter Univ.
Abstract :
This paper discusses how to predict hepatitis C virus protease cleavage sites in proteins using generalized linear indicator regression models. The mutual information is used for model-size optimization. Two simulation strategies are adopted, i.e., building a model based on published peptides and building a model based on the published peptides plus newly collected sequences. It is found that the latter outperforms the former significantly. The simulation also shows that the generalized linear indicator regression model far outperforms the multilayer perceptron model
Keywords :
enzymes; medical computing; microorganisms; molecular biophysics; optimisation; physiological models; regression analysis; generalized linear indicator regression models; hepatitis C virus protease cleavage site prediction; model-size optimization; multilayer perceptron; peptides; proteins; Amino acids; Drugs; Inhibitors; Liver diseases; Multilayer perceptrons; Mutual information; Peptides; Predictive models; Proteins; Sequences; Cleavage site prediction; generalized linear indicator regression models; hepatitis C virus; Algorithms; Amino Acid Sequence; Binding Sites; Enzyme Activation; HIV Protease; Hepacivirus; Hydrolysis; Linear Models; Models, Chemical; Models, Molecular; Molecular Sequence Data; Protein Binding; Regression Analysis; Sequence Analysis, Protein; Substrate Specificity;
Journal_Title :
Biomedical Engineering, IEEE Transactions on
DOI :
10.1109/TBME.2006.881779