Title of article :
Amino Acid Encoding Schemes from Protein Structure Alignments: Multi-dimensional Vectors to Describe Residue Types
Author/Authors :
LIN، نويسنده , , KUANG and MAY، نويسنده , , ALEX C.W. and TAYLOR، نويسنده , , WILLIAM R.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2002
Pages :
5
From page :
361
To page :
365
Abstract :
Bioinformatic software has used various numerical encoding schemes to describe amino acid sequences. Orthogonal encoding, employing 20 numbers to describe the amino acid type of one protein residue, is often used with artificial neural network (ANN) models. However, this can increase the model complexity, thus leading to difficulty in implementation and poor performance. Here, we use ANNs to derive encoding schemes for the amino acid types from protein three-dimensional structure alignments. Each of the 20 amino acid types is characterized with a few real numbers. Our schemes are tested on the simulation of amino acid substitution matrices. These simplified schemes outperform the orthogonal encoding on small data sets. Using one of these encoding schemes, we generate a colouring scheme for the amino acids in which comparable amino acids are in similar colours. We expect it to be useful for visual inspection and manual editing of protein multiple sequence alignments.
Journal title :
Journal of Theoretical Biology
Serial Year :
2002
Journal title :
Journal of Theoretical Biology
Record number :
1535300
Link To Document :
بازگشت