• DocumentCode
    2917205
  • Title

    Multithreaded FPGA acceleration of DNA sequence mapping

  • Author

    Fernandez, Eduardo B. ; Najjar, Walid ; Lonardi, Stefano ; Villarreal, Jesse

  • Author_Institution
    Univ. of California Riverside, Riverside, CA, USA
  • fYear
    2012
  • fDate
    10-12 Sept. 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In bioinformatics, short read alignment is a computationally intensive operation that involves matching millions of short strings (called reads) against a reference genome. At the time of writing, a representative run requires to match tens of millions of reads of length of about 100 symbols against a genome that can consists of a few billion characters. Existing short read aligners are expected to report all the occurrences of each read as well as allow users to control the number of allowed mismatches between reads and reference genome. Popular software implementations such as Bowtie [8] or BWA [10] can take many hours or days to execute, making the problem an ideal candidate for hardware acceleration. In this paper, we describe FHAST (FPGA Hardware Accelerated Sequencing-matching Tool), a hardware accelerator that acts as a drop-in replacement for short read alignment software. Our architecture masks memory latency by executing many concurrent hardware threads accessing memory simultaneously and consists of multiple parallel engines to exploit the parallelism available to us on an FPGA. We have implemented and tested FHAST on the Convey HC-1 [9], taking advantage of the large amount of memory bandwidth available to the system and the shared memory image between hardware and software. By comparing the performance of FHAST against Bowtie on the Convey HC-1 we observed up to ~70X improvement in total end-to-end execution time, reducing runs that take several hours to a few minutes. We also favorably compare the rate of growth when expanding FHAST to utilize multiple FPGAs against multiple CPUs in Bowtie.
  • Keywords
    DNA; bioinformatics; field programmable gate arrays; genomics; string matching; Convey HC-1; DNA sequence mapping; FHAST; FPGA hardware accelerated sequencing-matching tool; bioinformatics; computationally intensive operation; drop-in replacement; end-to-end execution time; hardware accelerator; memory latency; multithreaded FPGA acceleration; parallel engines; read aligners; reference genome; short read alignment; short read alignment software; string matching; Bioinformatics; Engines; Field programmable gate arrays; Genomics; Hardware; Instruction sets; FPGA; bioinformatics; hardware acceleration; multithreaded; short read matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Extreme Computing (HPEC), 2012 IEEE Conference on
  • Conference_Location
    Waltham, MA
  • Print_ISBN
    978-1-4673-1577-7
  • Type

    conf

  • DOI
    10.1109/HPEC.2012.6408669
  • Filename
    6408669