DocumentCode :
2175653
Title :
GPU-RMAP: Accelerating Short-Read Mapping on Graphics Processors
Author :
Aji, Ashwin M. ; Zhang, Liqing ; Feng, Wu-chun
Author_Institution :
Dept. of Comput. Sci., Virginia Tech, Blacksburg, VA, USA
fYear :
2010
fDate :
11-13 Dec. 2010
Firstpage :
168
Lastpage :
175
Abstract :
Next-generation, high-throughput sequencers are now capable of producing hundreds of billions of short sequences (reads) in a single day. The task of accurately mapping the reads back to a reference genome is of particular importance because it is used in several other biological applications, e.g., genome re-sequencing, DNA methylation, and ChiP sequencing. On a personal computer (PC), the computationally intensive short-read mapping task currently requires several hours to execute while working on very large sets of reads and genomes. Accelerating this task requires parallel computing. Among the current parallel computing platforms, the graphics processing unit (GPU) provides massively parallel computational prowess that holds the promise of accelerating scientific applications at low cost. In this paper, we propose GPU-RMAP, a massively parallel version of the RMAP short-read mapping tool that is highly optimized for the NVIDIA family of GPUs. We then evaluate GPU-RMAP by mapping millions of synthetic and real reads of varying widths on the mosquito (Aedes aegypti) and human genomes. We also discuss the effects of various input parameters, such as read width, number of reads, and chromosome size, on the performance of GPU-RMAP. We then show that despite using the conventionally “slower” but GPU-compatible binary search algorithm, GPU-RMAP outperforms the sequential RMAP implementation, which uses the “faster” hashing technique on a PC. Our data-parallel GPU implementation results in impressive speedups of up to 14:5-times for the mapping kernel and up to 9:6-times for the overall program execution time over the sequential RMAP implementation on a traditional PC.
Keywords :
computer graphic equipment; coprocessors; file organisation; parallel processing; search problems; GPU; GPU compatible binary search algorithm; GPU-RMAP; NVIDIA family; RMAP short read mapping tool; computationally intensive short read mapping task; data parallel GPU implementation; graphics processing unit; hashing technique; high throughput sequencer; massively parallel computational prowess; parallel computing; sequential RMAP implementation; short read mapping acceleration; Algorithm design and analysis; Bioinformatics; Genomics; Graphics processing unit; Instruction sets; Kernel; Table lookup; CUDA; RMAP; graphics processing unit (GPU); sequence analysis; short-read mapping;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Science and Engineering (CSE), 2010 IEEE 13th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-9591-7
Electronic_ISBN :
978-0-7695-4323-9
Type :
conf
DOI :
10.1109/CSE.2010.29
Filename :
5692471
Link To Document :
بازگشت