DocumentCode
2048443
Title
A method to improve structural modeling based on conserved domain clusters
Author
Zhang, Fa ; Xu, Lin ; Yuan, Bo
Author_Institution
Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
fYear
2006
fDate
25-29 April 2006
Abstract
Homology modeling requires an accurate alignment between a query sequence and its homologs with known three-dimensional (3D) information. Current structural modeling techniques largely use entire protein chains as templates, which are selected based only on their sequence alignments with the queries. Protein can be largely described as combinations of conserved domains, and already more than two-third of the known protein domains can be found in the protein data bank (PDB). We presented a method to improve structural modeling based on conserved domain clusters. First, we searched and mapped all the inter pro domains in the entire PDB, partitioned and clustered homologous domains into the domain-based template library. For each of the resulting clusters created, a multiple structural alignment was generated based only on the 3D coordinates of all the residues involved. Then we used the structural alignments as anchors to increase the alignment accuracy between a query and its templates, and consequently improve the quality of predicted structure for query protein. We implemented the method on DAWNING 4000A cluster system. The preliminary results show that our domain-based template library and the structure-anchored alignment protocol can be used for the partial prediction for a majority of known protein sequences with better qualities.
Keywords
biology computing; pattern clustering; proteins; solid modelling; conserved domain cluster; domain-based template library; homology modeling; multiple structural alignment; protein chain; protein data bank; query sequence; structural modeling; structure-anchored alignment protocol; Bioinformatics; Computational modeling; Computers; Crystallography; Genomics; Libraries; Nuclear magnetic resonance; Predictive models; Proteins; Spectroscopy;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
Print_ISBN
1-4244-0054-6
Type
conf
DOI
10.1109/IPDPS.2006.1639540
Filename
1639540
Link To Document