• DocumentCode
    2606826
  • Title

    FMSA: FPGA-Accelerated ClustalW-Based Multiple Sequence Alignment through Pipelined Prefiltering

  • Author

    Mahram, Atabak ; Herbordt, Martin C.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Boston Univ., Boston, MA, USA
  • fYear
    2012
  • fDate
    April 29 2012-May 1 2012
  • Firstpage
    177
  • Lastpage
    183
  • Abstract
    Multiple Sequence Alignment (MSA) is perhaps second only to sequence alignment in overall importance in Bioinformatics, being critical, e.g., in determining the structure and function of molecules from putative families of sequences. But while pair wise sequence alignment has been the subject of scores of FPGA acceleration studies, MSA only a few. The most important of these accelerate Clustal-W, the most commonly used MSA code, by either implementing the first of three phases (over 90% of the run time) with Dynamic Programming (DP) methods, or by accelerating the third phase which consumes most of the remaining time. We use a new approach: we apply prefiltering of the kind commonly used in BLAST to perform the initial all-pairs alignments. This results in a speedup of from 80× to 190× over the CPU code (8 cores) and speedup of from 2.5× to 8× over DP/FPGA- and GPU-based methods. When combined with a recently published method for phase 3, and using the original software for phase 2, the end-to-end speedup is at least 50× over an 8-core implementation of the original code. The quality is comparable to the original according to a commonly used benchmark suite evaluated with respect to multiple distance metrics.
  • Keywords
    dynamic programming; field programmable gate arrays; pipelines; 8-core implementation; BLAST; CPU code; DP method; DP-FPGA- methods; DP/FPGA-; FMSA; FPGA-accelerated ClustalW-based multiple sequence alignment; GPU-based methods; MSA code; bioinformatics; dynamic programming method; initial all-pairs alignments; molecule functions; pairwise sequence alignment; pipelined prefiltering; putative families; Acceleration; Biological information theory; Databases; Emulation; Field programmable gate arrays; Silicon; Standards; Bioinformatics; High Performance Reconfigurable Computing; Multiple Sequence Alignment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field-Programmable Custom Computing Machines (FCCM), 2012 IEEE 20th Annual International Symposium on
  • Conference_Location
    Toronto, ON
  • Print_ISBN
    978-1-4673-1605-7
  • Type

    conf

  • DOI
    10.1109/FCCM.2012.38
  • Filename
    6239811