DocumentCode
3183909
Title
String Matching in Hardware Using the FM-Index
Author
Fernandez, Edward ; Najjar, Walid ; Lonardi, Stefano
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of California Riverside, Riverside, CA, USA
fYear
2011
fDate
1-3 May 2011
Firstpage
218
Lastpage
225
Abstract
String matching is a ubiquitous problem that arises in a wide range of applications in computing, e.g., packet routing, intrusion detection, web querying, and genome analysis. Due to its importance, dozens of algorithms and several data structures have been developed over the years. A recent breakthrough in this field is the FM-index, a data structure that synergistically combines the Burrows-Wheeler transform and the suffix array. In software, the FM-index allows searching (exact and approximate) in times comparable to the fastest known indices for large texts (suffix trees and suffix arrays), but has the additional advantage of being more space-efficient than those approaches. In this paper, we describe the first FPGA-based hardware implementation of the FM-index for exact pattern matching. We report experimental results on the problem of mapping short DNA sequences to a reference genome. We show that the throughput of the FM-index is significantly higher than the naive (brute force) approach. Like the Bowtie software tool, the FM-index can abandon early the hardware matching. It outperforms Bowtie by two orders of magnitude.
Keywords
data structures; field programmable gate arrays; string matching; Burrows-Wheeler transform; DNA sequences; FM-index; FPGA-based hardware implementation; Web querying; data structures; exact pattern matching; genome analysis; intrusion detection; packet routing; string matching; suffix array; ubiquitous problem; Arrays; Equations; Field programmable gate arrays; Hardware; Indexes; Multiplexing; Transforms; FPGA; Reconfigurable Computing; String matching;
fLanguage
English
Publisher
ieee
Conference_Titel
Field-Programmable Custom Computing Machines (FCCM), 2011 IEEE 19th Annual International Symposium on
Conference_Location
Salt Lake City, UT
Print_ISBN
978-1-61284-277-6
Electronic_ISBN
978-0-7695-4301-7
Type
conf
DOI
10.1109/FCCM.2011.55
Filename
5771277
Link To Document