DocumentCode :
174838
Title :
Protein Motif Retrieval by Secondary Structure Element Geometry and Biological Features Saliency
Author :
Cantoni, Virginio ; Ferretti, Marco ; Pellicano, Nicola ; Vandoni, Jennifer ; Musci, Mirto ; Nugrahaningsih, Nahumi
Author_Institution :
Dept. of Electr., Comput. & Biomed. Eng., Univ. of Pavia, Pavia, Italy
fYear :
2014
fDate :
1-5 Sept. 2014
Firstpage :
23
Lastpage :
27
Abstract :
This paper presents an approach to detect the presence of a given motif in proteins or in protein data bank (PDB). The approach is based on the secondary structure elements (SSEs) geometrical arrangement in 3D space. A motif is represented as a set of SSEs in their specific positions related to a local reference system (LRS). We propose, exploiting the SSE biological feature saliency in the motif LRS construction stage, a planning strategy to speed-up the motif retrieval process. The experimentation has been carried out on a set of 20 proteins selected from the PDB. In detail we tested five different cases: (i) performances on searching a motif within single proteins, (ii) searching motifs on a set of proteins belonging to the same biological family, (iii) searching into single symmetric proteins, (iv) searching on a set of symmetric proteins from the same family, and finally (v) a general motif retrieval from the entire protein dataset. The experimental results showed good motif recognition performances on each test category, and, by exploiting the basic biological features saliency in motif construction, comparing to a previous approach of SSEs block geometrical retrieval based on the Generalized Hough Transform, it was revealed a significant decrease of the time/space computational complexity. It is worth to point out that the computation time for the case of motif absence is significantly lower than the case of motif present.
Keywords :
biology computing; computational complexity; molecular biophysics; molecular configurations; proteins; 3D space; SSE biological feature saliency; biological family; biological feature saliency; geometrical arrangement; local reference system; motif LRS construction stage; planning strategy; protein data bank; protein motif retrieval process; secondary structure element geometry; space computational complexity; symmetric proteins; time computational complexity; Computers; Geometry; Protein engineering; Proteins; Vectors; geometry feature; pattern recognition; protein motif retrieval; secondary structure;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2014 25th International Workshop on
Conference_Location :
Munich
ISSN :
1529-4188
Print_ISBN :
978-1-4799-5721-7
Type :
conf
DOI :
10.1109/DEXA.2014.22
Filename :
6974821
Link To Document :
بازگشت