DocumentCode :
633050
Title :
Protein database search optimization based on CUDA and MPI
Author :
Pavlovic, D. ; Vaser, Robert ; Korpar, Matija ; Sikic, Mile
Author_Institution :
Fac. of Electr. Eng. & Comput., Univ. of Zagreb, Zagreb, Croatia
fYear :
2013
fDate :
20-24 May 2013
Firstpage :
1278
Lastpage :
1280
Abstract :
Protein database search is an important method in the field of computational biology. There are a large number of sequences in an average database which makes such searches rather time and resource consuming. With the rapid growth in size of these databases in the past years, there came a need to speed up the search and consequently, any alignments performed on such databases.This paper presents an acceleration of the database search tool sw#DB which is based on a CUDA implementation of Smith-Waterman algorithm. We achieved speed up by reducing database size. The whole database was divided into seeds of a fixed length. The positions of these seeds and the corresponding sequence indexes from the database are then stored in a hash container. This allows for a constant time lookup of all the positions of a seed in every sequence of a database. Potential alignment candidate sequences for a query are filtered using this method, forwarding only those which contain at least one seed from the query to the sw#DB. This reduces the number of alignments performed. Overall, it brings a speedup of around three times compared to the basic sw#DB tool, based solely on Smith Waterman algorithm, with almost no loss of accuracy. The implementation is written in CUDA and C programming languages. For large queries, MPI implementation with multiple CUDA cards is used.
Keywords :
C language; application program interfaces; biology computing; information filtering; message passing; parallel algorithms; parallel architectures; proteins; query processing; CUDA implementation; MPI; computational biology; database search tool; protein database search optimization; sw#DB; Accuracy; Graphics processing units; Indexes; Optimization; Proteins; Software algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information & Communication Technology Electronics & Microelectronics (MIPRO), 2013 36th International Convention on
Conference_Location :
Opatija
Print_ISBN :
978-953-233-076-2
Type :
conf
Filename :
6596454
Link To Document :
بازگشت