Title of article :
Data mining the protein data bank: automatic detection and assignment of carbohydrate structures Original Research Article
Author/Authors :
Thomas Lütteke، نويسنده , , Martin Frank and Lluis Fontbote ، نويسنده , , Claus-W von der Lieth، نويسنده ,
Issue Information :
دوهفته نامه با شماره پیاپی سال 2004
Pages :
6
From page :
1015
To page :
1020
Abstract :
Knowledge of the 3D structure of glycans is a prerequisite for a complete understanding of the biological processes glycoproteins are involved in. However, due to a lack of standardised nomenclature, carbohydrate compounds are difficult to locate within the Protein Data Bank (PDB). Using an algorithm that detects carbohydrate structures only requiring element types and atom coordinates, we were able to detect 1663 entries containing a total of 5647 carbohydrate chains. The majority of chains are found to be N-glycosidically bound. Noncovalently bound ligands are also frequent, while O-glycans form a minority. About 30% of all carbohydrate containing PDB entries comprise one or several errors. The automatic assignment of carbohydrate structures in PDB entries will improve the cross-linking of glycobiology resources with genomic and proteomic data collections, which will be an important issue of the upcoming glycomics projects. By aiding in detection of erroneous annotations and structures, the algorithm might also help to increase database quality.
Keywords :
Data analysis , 3D structure database , Glycosylation , Bioinformatics , Algorithm
Journal title :
Carbohydrate Research
Serial Year :
2004
Journal title :
Carbohydrate Research
Record number :
964052
Link To Document :
بازگشت