DocumentCode :
3181619
Title :
A novel approach to protein substructure matching
Author :
Vardharaj, L. ; Singh, Parampreet ; Ranjan, Sumit ; Rajagopalan, Narendran ; Mala, C.
Author_Institution :
Dept. of Comput. Sci. & Eng., Nat. Inst. of Technol.(NIT), Trichy, India
fYear :
2011
fDate :
11-14 Dec. 2011
Firstpage :
540
Lastpage :
545
Abstract :
The rapidly increasing volumes of structural data of proteins has led to need of algorithms which can rapidly predict functions for proteins based on structure. Similarity between protein structures can provide evidence of possible functional similarity. In this paper, an attempt is made to efficiently recognize similar protein structures in the protein database contain thousands of proteins. This paper gives an efficient heuristic algorithm for finding protein 3D substructures in a 3D protein structure that are similar to a given query 3D protein substructure. This algorithm can be used for searching a database of protein 3D structures. Our approach is to divide the protein structure into sub-structures of size of query structure and compare each sub structure with the query protein using Procrustes algorithm which is based on the root mean square distance between the structures. The division involves constructing a bounding box over both the query and protein structure and dividing the bigger box into sizes of the smaller box. The above algorithm is implemented in parallel using message passing interface. Experiments show that our algorithm can find similar 3D substructures in reasonable time. This paper also presents various statistics as how our algorithm performs against a sequential algorithm and how the algorithm performs with varying sizes of the query structure.
Keywords :
application program interfaces; bioinformatics; data structures; message passing; molecular biophysics; proteins; query languages; query processing; solid modelling; 3D protein substructure matching; bounding box; functional similarity; heuristic algorithm; message passing interface; protein database searching; query structure; sequential algorithm; structural data; Algorithm design and analysis; Computer science; Databases; Periodic structures; Proteins; Root mean square; Three dimensional displays; Bioinformatics; Bounding box; Correspondence and Alignment; Message passing interface(MPI); Procrustes; Proteins; Root mean square deviation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technologies (WICT), 2011 World Congress on
Conference_Location :
Mumbai
Print_ISBN :
978-1-4673-0127-5
Type :
conf
DOI :
10.1109/WICT.2011.6141303
Filename :
6141303
Link To Document :
بازگشت