DocumentCode
2530864
Title
A Geometric Representation of Protein Sequences
Author
Gu, Shengyin ; Poch, Olivier ; Hamann, Bernd ; Koehl, Patrice
Author_Institution
Univ. of California, Davis
fYear
2007
fDate
2-4 Nov. 2007
Firstpage
135
Lastpage
142
Abstract
The amino acid sequence of a protein is the key to understanding its structure and ultimately its function in the cell. This paper addresses the fundamental issue of encoding amino acids in ways that the visualization of protein sequences facilitates the decoding of its information content. We show that a feature-based representation in a three-dimensional (3D) space derived from substitution matrices provides an adequate representation from which the domain content of a protein can be predicted. In addition, we show that each dimension of the feature space can be related to a physical property of the amino acids.
Keywords
cellular biophysics; molecular biophysics; molecular configurations; proteins; amino acid encoding; amino acid sequence; cells; geometric representation; protein function; protein sequences; protein structure; protein visualization; substitution matrix; Amino acids; Bioinformatics; Biological information theory; Computer science; Data analysis; Data visualization; Encoding; Genomics; Protein sequence; USA Councils;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine, 2007. BIBM 2007. IEEE International Conference on
Conference_Location
Fremont, CA
Print_ISBN
978-0-7695-3031-4
Type
conf
DOI
10.1109/BIBM.2007.22
Filename
4413047
Link To Document