DocumentCode :
3198310
Title :
A parameterisable and scalable Smith-Waterman algorithm implementation on CUDA-compatible GPUs
Author :
Ling, Cheng ; Benkrid, Khaled ; Hamada, Tsuyoshi
Author_Institution :
Inst. for Integrated Micro & Nano Syst., Univ. of Edinburgh, Edinburgh, UK
fYear :
2009
fDate :
27-28 July 2009
Firstpage :
94
Lastpage :
100
Abstract :
This paper describes a multi-threaded parallel design and implementation of the Smith-Waterman (SM) algorithm on compute unified device architecture (CUDA)-compatible graphic processing units (GPUs). A novel technique has been put forward to solve the restriction on the length of the query sequence in previous GPU implementations of the Smith-Waterman algorithm. The main reasons behind this limitation in previous GPU implementations were the finite size of local memory and number of threads per block. Our solution to this problem uses a divide and conquer approach to compute the alignment matrix involved in each pairwise sequence alignment, as it divides the entire matrix computation into multiple sub-matrices and allocates the available amount of threads and memory resources to each sub-matrix iteratively. Intermediate data is stored in shared and global memory on the fly depending on the length of sequences in hand. The proposed technique resulted in up to 4.2 GCUPS (Giga Cell Updates per Second) performance when tested against the SWISS-PROT protein database, which is up to 15 times faster than a equivalent optimised CPU-only implementation running on a Pentium 4 3.4 GHz desktop computer. Moreover, our implementation can cope with any query or subject sequence size, unlike previously reported GPU implementations of the Smith-Waterman algorithm which makes it fully deployable in real world bioinformatics applications.
Keywords :
bioinformatics; computer graphic equipment; computer graphics; iterative methods; matrix algebra; multi-threading; parallel algorithms; parallel architectures; sequences; storage allocation; CUDA-compatible GPU; alignment matrix; bioinformatics; compute unified device architecture; divide-conquer approach; graphic processing unit; iterative method; memory resource allocation; multithreaded parallel design; pairwise sequence alignment; protein database; query sequence; scalable Smith-Waterman algorithm; Algorithm design and analysis; Computer architecture; Concurrent computing; Databases; Graphics; Proteins; Resource management; Samarium; Testing; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Application Specific Processors, 2009. SASP '09. IEEE 7th Symposium on
Conference_Location :
San Francisco, CA
Print_ISBN :
978-1-4244-4939-2
Electronic_ISBN :
978-1-4244-4938-5
Type :
conf
DOI :
10.1109/SASP.2009.5226343
Filename :
5226343
Link To Document :
بازگشت