DocumentCode :
166706
Title :
A hybrid OpenMP and OpenMPI approach to geometrical motif search in proteins
Author :
Ferretti, Marco ; Musci, Mirto ; Santangelo, Luigi
Author_Institution :
Dept. of Electr., Comput. & Biomed. Eng., Univ. of Pavia, Pavia, Italy
fYear :
2014
fDate :
22-26 Sept. 2014
Firstpage :
298
Lastpage :
304
Abstract :
The retrieval and identification of geometrical motifs is an important open problem in bioinformatics. In previous works we presented Cross Motif Search (CMS), a novel algorithm which is able to search for recurring geometrical patterns in the secondary structure of proteins. A single run of CMS is able to look for similarities between a pair of proteins, and can be easily extended to compare each pair of proteins in an arbitrarily large dataset. We have implemented a shared memory parallel version of CMS and analyzed its scalability, which is limited to 8 cores. So, when the number of proteins in the set increases, the execution time of the algorithm quickly becomes unmanageable and the OpenMP implementation cannot keep up by just increasing the number of cores. In this paper we present a new hybrid parallel implementation of CMS, which combines the previous OpenMP approach with OpenMPI. Experimental runs on the same small-sized server (32 cores) show that the best hybrid OpenMP-OpenMPI configuration outperforms the best OpenMP one by a factor of 13.52. This result is confirmed on a medium-sized cluster with 256 cores, that allows the processing a larger data set in reasonable times. We also show that the new design is able to achieve great efficiency and scalability, which allows us to process huge data-set of proteins up to, in theory, the entire Protein Data Bank.
Keywords :
application program interfaces; bioinformatics; proteins; public domain software; shared memory systems; Protein Data Bank; algorithm execution time; bioinformatics; cross-motif search; efficiency analysis; geometrical motif identification; geometrical motif retrieval; geometrical motif search; hybrid OpenMP-OpenMPI approach; hybrid parallel implementation; large-data set processing; medium-sized cluster; protein-pair similarities; recurring geometrical pattern search; scalability analysis; secondary protein structure; shared-memory parallel CMS; small-sized server; Algorithm design and analysis; Multicore processing; Proteins; Scalability; Servers; Testing; OpenMP; OpenMPI; geometrical motif search; hybrid parallelism; load-balancing; proteins; secondary structure;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing (CLUSTER), 2014 IEEE International Conference on
Conference_Location :
Madrid
Type :
conf
DOI :
10.1109/CLUSTER.2014.6968787
Filename :
6968787
Link To Document :
بازگشت