DocumentCode
3532958
Title
An integrated database for complex protein structure modeling
Author
Wang, Qiang ; Dunbrack, Roland L., Jr.
Author_Institution
Fox Chase Cancer Center, Philadelphia, PA
fYear
2008
fDate
3-5 Nov. 2008
Firstpage
33
Lastpage
40
Abstract
In homology modeling of protein structures, it is typical to find templates through a sequence search against a database of proteins with known structures. In more complicated modeling cases, such as modeling a protein structure in contact with a ligand, sequence information itself may not be enough and more biological information is required for a successful modeling process. SCOP and PFAM are two databases providing protein domain information which can be utilized in complex protein structure modeling. However, due to the manuallycurated nature of both databases, they fail to provide timely coverage of protein sequences existing in the Protein Data Bank (PDB). In this paper, we introduce a new relational database, IDOPS, which integrates sequence and biological information extracted from remediated PDB files and protein domain information generated with HMM profiles of PFAM families. With a carefully designed protocol, this database is updated regularly and the coverage rate of PDB entries is guaranteed to be high.
Keywords
information retrieval; macromolecules; medical information systems; proteins; relational databases; PFAM; Protein Data Bank; SCOP; complex protein structure modeling; protein domain information; relational database; remediated PDB files; sequence information; sequence search; Biological system modeling; Cancer; Data mining; Hidden Markov models; Predictive models; Proteins; Protocols; Relational databases; Sequences; Spine;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomeidcine Workshops, 2008. BIBMW 2008. IEEE International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
978-1-4244-2890-8
Type
conf
DOI
10.1109/BIBMW.2008.4686206
Filename
4686206
Link To Document