• DocumentCode
    3183909
  • Title

    String Matching in Hardware Using the FM-Index

  • Author

    Fernandez, Edward ; Najjar, Walid ; Lonardi, Stefano

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of California Riverside, Riverside, CA, USA
  • fYear
    2011
  • fDate
    1-3 May 2011
  • Firstpage
    218
  • Lastpage
    225
  • Abstract
    String matching is a ubiquitous problem that arises in a wide range of applications in computing, e.g., packet routing, intrusion detection, web querying, and genome analysis. Due to its importance, dozens of algorithms and several data structures have been developed over the years. A recent breakthrough in this field is the FM-index, a data structure that synergistically combines the Burrows-Wheeler transform and the suffix array. In software, the FM-index allows searching (exact and approximate) in times comparable to the fastest known indices for large texts (suffix trees and suffix arrays), but has the additional advantage of being more space-efficient than those approaches. In this paper, we describe the first FPGA-based hardware implementation of the FM-index for exact pattern matching. We report experimental results on the problem of mapping short DNA sequences to a reference genome. We show that the throughput of the FM-index is significantly higher than the naive (brute force) approach. Like the Bowtie software tool, the FM-index can abandon early the hardware matching. It outperforms Bowtie by two orders of magnitude.
  • Keywords
    data structures; field programmable gate arrays; string matching; Burrows-Wheeler transform; DNA sequences; FM-index; FPGA-based hardware implementation; Web querying; data structures; exact pattern matching; genome analysis; intrusion detection; packet routing; string matching; suffix array; ubiquitous problem; Arrays; Equations; Field programmable gate arrays; Hardware; Indexes; Multiplexing; Transforms; FPGA; Reconfigurable Computing; String matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field-Programmable Custom Computing Machines (FCCM), 2011 IEEE 19th Annual International Symposium on
  • Conference_Location
    Salt Lake City, UT
  • Print_ISBN
    978-1-61284-277-6
  • Electronic_ISBN
    978-0-7695-4301-7
  • Type

    conf

  • DOI
    10.1109/FCCM.2011.55
  • Filename
    5771277