• DocumentCode
    3532958
  • Title

    An integrated database for complex protein structure modeling

  • Author

    Wang, Qiang ; Dunbrack, Roland L., Jr.

  • Author_Institution
    Fox Chase Cancer Center, Philadelphia, PA
  • fYear
    2008
  • fDate
    3-5 Nov. 2008
  • Firstpage
    33
  • Lastpage
    40
  • Abstract
    In homology modeling of protein structures, it is typical to find templates through a sequence search against a database of proteins with known structures. In more complicated modeling cases, such as modeling a protein structure in contact with a ligand, sequence information itself may not be enough and more biological information is required for a successful modeling process. SCOP and PFAM are two databases providing protein domain information which can be utilized in complex protein structure modeling. However, due to the manuallycurated nature of both databases, they fail to provide timely coverage of protein sequences existing in the Protein Data Bank (PDB). In this paper, we introduce a new relational database, IDOPS, which integrates sequence and biological information extracted from remediated PDB files and protein domain information generated with HMM profiles of PFAM families. With a carefully designed protocol, this database is updated regularly and the coverage rate of PDB entries is guaranteed to be high.
  • Keywords
    information retrieval; macromolecules; medical information systems; proteins; relational databases; PFAM; Protein Data Bank; SCOP; complex protein structure modeling; protein domain information; relational database; remediated PDB files; sequence information; sequence search; Biological system modeling; Cancer; Data mining; Hidden Markov models; Predictive models; Proteins; Protocols; Relational databases; Sequences; Spine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomeidcine Workshops, 2008. BIBMW 2008. IEEE International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    978-1-4244-2890-8
  • Type

    conf

  • DOI
    10.1109/BIBMW.2008.4686206
  • Filename
    4686206