Title :
Optimizing Burrows-Wheeler Transform-Based Sequence Alignment on Multicore Architectures
Author :
Jing Zhang ; Heshan Lin ; Balaji, Pavan ; Wu-Chun Feng
Abstract :
Computational biology sequence alignment tools using the Burrows-Wheeler Transform (BWT) are widely used in next-generation sequencing (NGS) analysis. However, despite extensive optimization efforts, the performance of these tools still cannot keep up with the explosive growth of sequencing data. Through an in-depth performance analysis of BWA, a popular BWT-based aligner on multicore architectures, we demonstrate that such tools are limited by memory bandwidth due to their irregular memory access patterns. We then propose a locality-aware implementation of BWA that aims at optimizing its performance by better exploiting the caching mechanisms of modern multicore processors. Experimental results show that our improved BWA implementation can reduce last-level cache (LLC) misses by 30% and translation look aside buffer (TLB) misses by 20%, resulting in up to 2.6-fold speedup over the original BWA implementation.
Keywords :
biology computing; cache storage; microprocessor chips; multiprocessing systems; optimisation; transforms; BWT-based aligner; Burrows-Wheeler transform-based sequence alignment optimization; LLC; NGS; TLB; caching mechanisms; computational biology sequence alignment tools; irregular memory access patterns; last-level cache misses; multicore architectures; multicore processors; next-generation sequencing analysis; translation lookaside buffer misses; Algorithm design and analysis; Bioinformatics; Data structures; Genomics; Multicore processing; Program processors; Sequential analysis; Burrows-Wheeler transform; FM-index; bioinformatics; data locality; multicore; short-read mapping;
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2013 13th IEEE/ACM International Symposium on
Conference_Location :
Delft
Print_ISBN :
978-1-4673-6465-2
DOI :
10.1109/CCGrid.2013.67