DocumentCode :
1080256
Title :
Three-Dimensional Shape-Structure Comparison Method for Protein Classification
Author :
Daras, P. ; Zarpalas, D. ; Axenopoulos, A. ; Tzovaras, D. ; Strintzis, M.G.
Author_Institution :
Informatics & Telematics institute, Thessaloniki
Volume :
3
Issue :
3
fYear :
2006
Firstpage :
193
Lastpage :
207
Abstract :
In this paper, a 3D shape-based approach is presented for the efficient search, retrieval, and classification of protein molecules. The method relies primarily on the geometric 3D structure of the proteins, which is produced from the corresponding PDB files and secondarily on their primary and secondary structure. After proper positioning of the 3D structures, in terms of translation and scaling, the spherical trace transform is applied to them so as to produce geometry-based descriptor vectors, which are completely rotation invariant and perfectly describe their 3D shape. Additionally, characteristic attributes of the primary and secondary structure of the protein molecules are extracted, forming attribute-based descriptor vectors. The descriptor vectors are weighted and an integrated descriptor vector is produced. Three classification methods are tested. A part of the FSSP/DALI database, which provides a structural classification of the proteins, is used as the ground truth in order to evaluate the classification accuracy of the proposed method. The experimental results show that the proposed method achieves more than 99 percent classification accuracy while remaining much simpler and faster than the DALI method
Keywords :
biology computing; information retrieval; molecular configurations; proteins; FSSP/DALI database; geometric 3D protein structure; geometry-based descriptor vectors; information retrieval; integrated descriptor vector; primary structure; protein classification; secondary structure; spherical trace transform; structural classification; three-dimensional shape-structure comparison method; Chemicals; Crystallography; Databases; Evolution (biology); Information retrieval; Laboratories; Proteins; Shape; Space technology; Testing; Information search and retrieval; classification; protein databases.; Algorithms; Amino Acid Sequence; Computer Simulation; Databases, Protein; Information Storage and Retrieval; Models, Chemical; Models, Molecular; Molecular Sequence Data; Protein Conformation; Proteins; Sequence Alignment; Sequence Analysis, Protein; Structure-Activity Relationship;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2006.43
Filename :
1668019
Link To Document :
بازگشت