GPU-RMAP: Accelerating Short-Read Mapping on Graphics Processors

Author

Aji, Ashwin M. ; Zhang, Liqing ; Feng, Wu-chun

Author_Institution

Dept. of Comput. Sci., Virginia Tech, Blacksburg, VA, USA

fYear

2010

fDate

11-13 Dec. 2010

Firstpage

168

Lastpage

175

Abstract

Next-generation, high-throughput sequencers are now capable of producing hundreds of billions of short sequences (reads) in a single day. The task of accurately mapping the reads back to a reference genome is of particular importance because it is used in several other biological applications, e.g., genome re-sequencing, DNA methylation, and ChiP sequencing. On a personal computer (PC), the computationally intensive short-read mapping task currently requires several hours to execute while working on very large sets of reads and genomes. Accelerating this task requires parallel computing. Among the current parallel computing platforms, the graphics processing unit (GPU) provides massively parallel computational prowess that holds the promise of accelerating scientific applications at low cost. In this paper, we propose GPU-RMAP, a massively parallel version of the RMAP short-read mapping tool that is highly optimized for the NVIDIA family of GPUs. We then evaluate GPU-RMAP by mapping millions of synthetic and real reads of varying widths on the mosquito (Aedes aegypti) and human genomes. We also discuss the effects of various input parameters, such as read width, number of reads, and chromosome size, on the performance of GPU-RMAP. We then show that despite using the conventionally “slower” but GPU-compatible binary search algorithm, GPU-RMAP outperforms the sequential RMAP implementation, which uses the “faster” hashing technique on a PC. Our data-parallel GPU implementation results in impressive speedups of up to 14:5-times for the mapping kernel and up to 9:6-times for the overall program execution time over the sequential RMAP implementation on a traditional PC.

Keywords

computer graphic equipment; coprocessors; file organisation; parallel processing; search problems; GPU; GPU compatible binary search algorithm; GPU-RMAP; NVIDIA family; RMAP short read mapping tool; computationally intensive short read mapping task; data parallel GPU implementation; graphics processing unit; hashing technique; high throughput sequencer; massively parallel computational prowess; parallel computing; sequential RMAP implementation; short read mapping acceleration; Algorithm design and analysis; Bioinformatics; Genomics; Graphics processing unit; Instruction sets; Kernel; Table lookup; CUDA; RMAP; graphics processing unit (GPU); sequence analysis; short-read mapping;

fLanguage

English

Publisher

ieee

Conference_Titel

Computational Science and Engineering (CSE), 2010 IEEE 13th International Conference on

Conference_Location

Hong Kong

Print_ISBN

978-1-4244-9591-7

Electronic_ISBN

978-0-7695-4323-9

Type

conf

DOI

10.1109/CSE.2010.29

Filename

5692471