DocumentCode
853227
Title
Inferring Boundary Information of Discontinuous-Domain Proteins
Author
Sikder, Abdur R. ; Zomaya, Albert Y.
Author_Institution
Int. Comput. Sci. Inst., Univ. of California, Berkeley, CA
Volume
7
Issue
3
fYear
2008
Firstpage
200
Lastpage
205
Abstract
Wetlaufer introduced the classification of domains into continuous and discontinuous. Continuous domains form from a single-chain segment and discontinuous domains are composed of two or more chain segments. Richardson identified approximately 100 domains in her review. Her assignment was based on the concepts that the domain would be independently stable and/or could undergo rigid-body-like movements with respect to the entire protein. There are now several instances where structurally similar domains occur in different proteins in the absence of noticeable sequence similarity. Possibly, the most notable of such domains is the trios-phosphate isomerase (TIM) barrel. With the increase in the number of known sequences, computer algorithms are required to identify the discontinuous domain of an unknown protein chain in order to determine its structure and function. We have developed a novel algorithm for discontinuous-domain boundary prediction based on a machine learning algorithm and interresidue contact interactions values. We have used 415 proteins, including 100 discontinuous-domain chains for training. There is no method available that is designed solely on a sequence based for the prediction of discontinuous domain. Domain discovery performed significantly well compared to the structure-based methods like structural classification of proteins (SCOP), class, architecture, topology and homologous superfamily (CATH), and domain maker (DOMAK).
Keywords
biology computing; learning (artificial intelligence); proteins; class, architecture, topology and homologous superfamily; discontinuous-domain proteins; domain maker; interresidue contact interactions; machine learning algorithm; sequence similarity; structural classification of proteins; trios-phosphate isomerase barrel; Discontinuous domain; domain boundary; machine learning; protein domain; Algorithms; Amino Acid Sequence; Artificial Intelligence; Information Storage and Retrieval; Molecular Sequence Data; Pattern Recognition, Automated; Protein Structure, Tertiary; Proteins; Sequence Alignment; Sequence Analysis, Protein;
fLanguage
English
Journal_Title
NanoBioscience, IEEE Transactions on
Publisher
ieee
ISSN
1536-1241
Type
jour
DOI
10.1109/TNB.2008.2002283
Filename
4618629
Link To Document