DocumentCode
676355
Title
An acceleration method of short read mapping using FPGA
Author
Sogabe, Yusuke ; Maruyama, Tetsuhiro
Author_Institution
Syst. & Inf. Eng., Univ. of Tsukuba, Tsukuba, Japan
fYear
2013
fDate
9-11 Dec. 2013
Firstpage
350
Lastpage
353
Abstract
The rapid development of Next Generation Sequencing (NGS) has enabled to generate more than 100G base pairs per day from one machine. The produced data are randomly fragmented DNA base pair strings, called short reads, and millions of short reads are mapped onto the reference genomes, which are complete genetic sequences, to reconstruct the sequence of the sample DNA. This short read mapping is becoming the bottle-neck of NGS systems. In this paper, we propose an FPGA system for the mapping based on a hash-index method. In our system, short reads are divided into seeds, which are fixed-length substrings used for the mapping, and the seeds are sorted using buckets. Then, the seeds in each bucket are compared in parallel with the candidate locations. With this approach, many seeds can be compared in massively parallel manner with their candidate locations, and it becomes possible to improve the processing speed by reducing the number of the random accesses to DRAM banks which store the candidate locations. Furthermore, substitutions of the nucleotides in a seed can be allowed in this parallel comparison. This makes it possible to achieve higher matching rates than previous works.
Keywords
DNA; bioinformatics; field programmable gate arrays; file organisation; genomics; DNA base pair strings; DRAM banks; FPGA system; NGS; candidate locations; fixed-length substrings; hash-index method; next generation sequencing; nucleotides; parallel comparison; random accesses; reference genomes; sample DNA; short read mapping; Accuracy; DNA; Field programmable gate arrays; Genomics; Indexes; Random access memory; Registers;
fLanguage
English
Publisher
ieee
Conference_Titel
Field-Programmable Technology (FPT), 2013 International Conference on
Conference_Location
Kyoto
Print_ISBN
978-1-4799-2199-7
Type
conf
DOI
10.1109/FPT.2013.6718385
Filename
6718385
Link To Document