• DocumentCode
    2048443
  • Title

    A method to improve structural modeling based on conserved domain clusters

  • Author

    Zhang, Fa ; Xu, Lin ; Yuan, Bo

  • Author_Institution
    Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
  • fYear
    2006
  • fDate
    25-29 April 2006
  • Abstract
    Homology modeling requires an accurate alignment between a query sequence and its homologs with known three-dimensional (3D) information. Current structural modeling techniques largely use entire protein chains as templates, which are selected based only on their sequence alignments with the queries. Protein can be largely described as combinations of conserved domains, and already more than two-third of the known protein domains can be found in the protein data bank (PDB). We presented a method to improve structural modeling based on conserved domain clusters. First, we searched and mapped all the inter pro domains in the entire PDB, partitioned and clustered homologous domains into the domain-based template library. For each of the resulting clusters created, a multiple structural alignment was generated based only on the 3D coordinates of all the residues involved. Then we used the structural alignments as anchors to increase the alignment accuracy between a query and its templates, and consequently improve the quality of predicted structure for query protein. We implemented the method on DAWNING 4000A cluster system. The preliminary results show that our domain-based template library and the structure-anchored alignment protocol can be used for the partial prediction for a majority of known protein sequences with better qualities.
  • Keywords
    biology computing; pattern clustering; proteins; solid modelling; conserved domain cluster; domain-based template library; homology modeling; multiple structural alignment; protein chain; protein data bank; query sequence; structural modeling; structure-anchored alignment protocol; Bioinformatics; Computational modeling; Computers; Crystallography; Genomics; Libraries; Nuclear magnetic resonance; Predictive models; Proteins; Spectroscopy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
  • Print_ISBN
    1-4244-0054-6
  • Type

    conf

  • DOI
    10.1109/IPDPS.2006.1639540
  • Filename
    1639540