DocumentCode
2214651
Title
Exploration of Short Reads Genome Mapping in Hardware
Author
Fernandez, Edward ; Najjar, Walid ; Harris, Elena ; Lonardi, Stefano
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of California, Riverside, Riverside, CA, USA
fYear
2010
fDate
Aug. 31 2010-Sept. 2 2010
Firstpage
360
Lastpage
363
Abstract
The newest generation of sequencing instruments, such as Illumina/Solexa Genome Analyzer and ABI SOLiD, can generate hundreds of millions of short DNA “reads” from a single run. These reads must be matched against a reference genome to identify their original location. Due to sequencing errors or variations in the sequenced genome, the matching procedure must allow a variable but limited number of mismatches. This problem is a version of the classic approximate string matching where a long text is searched for the occurrence of a set of short patterns. Typical strategies to speed up the matching involve elaborate hashing schemes that exploit the inherent repetitions of the data. However, such large data structures are not well suited for FPGA implementations. In this paper we evaluate an FPGA implementation that uses a “naive” approach which checks every possible read-genome alignment. We compare the performance of the naive approach to popular software tools currently used to map short reads to a reference genome showing a speedup of up to 4X over the fastest software tool.
Keywords
bioinformatics; cellular biophysics; field programmable gate arrays; genomics; logic design; parallel architectures; string matching; ABI SOLiD; FPGA implementation; Illumina-Solexa genome analyzer; genome mapping; hashing schemes; large data structures; read-genome alignment; sequenced genome; sequencing errors; software tools; string matching; Reconfigurable computing; bioinformatics; component; string-matching;
fLanguage
English
Publisher
ieee
Conference_Titel
Field Programmable Logic and Applications (FPL), 2010 International Conference on
Conference_Location
Milano
ISSN
1946-1488
Print_ISBN
978-1-4244-7842-2
Type
conf
DOI
10.1109/FPL.2010.78
Filename
5694277
Link To Document