Title of article :
Polypeptide sequence property relationships in Escherichia coli based on auto cross covariances
Author/Authors :
Sjِstrِm، نويسنده , , Michael and Rنnnar، نويسنده , , Stefan and Wieslander، نويسنده , , إke، نويسنده ,
Issue Information :
دوفصلنامه با شماره پیاپی سال 1995
Pages :
11
From page :
295
To page :
305
Abstract :
For multivariate classification and quantitative structure activity studies of proteins, which involve amino acid sequences of different length, preprocessing methods are needed which make it possible to translate the sequence into a quantitative measure with the same number of variables. hree different preprocessing methods are investigated. Two of the methods are variants of auto cross covariances calculated from a multipositional description of the protein sequence. For the multipositional description three orthogonal scales were used which physico-chemically describes the amino acids. The third method is a quantification of each sequence by a diamino acid frequency histogram. The methods are investigated by a classification of 106 Escherichia coli and Gram-negative bacteria proteins. The proteins were divided into four classes depending on their location in the cell. The four classes were: cytoplasm, inner membrane, periplasm and outer membrane. For the proceeding classification PLS discriminant analysis was used. sults showed that one of the variants of auto cross covariances and the diamino acid frequency histogram representation contained much information related to the given classification problem. Hence the amino acid sequences for proteins with different final locations in Escherichia coli have significant features related to protein structure and location.
Keywords :
Peptide sequences , Partial least squares discriminant analysis , Protein classification , Sequence analysis , Auto cross covariances , Multivariate data analysis
Journal title :
Chemometrics and Intelligent Laboratory Systems
Serial Year :
1995
Journal title :
Chemometrics and Intelligent Laboratory Systems
Record number :
1459419
Link To Document :
بازگشت