• DocumentCode
    3600901
  • Title

    Boosting the FM-Index on the GPU: Effective Techniques to Mitigate Random Memory Access

  • Author

    Chacon, Alejandro ; Marco-Sola, Santiago ; Espinosa, Antonio ; Ribeca, Paolo ; Moure, Juan Carlos

  • Author_Institution
    Comput. Archit. & Oper. Syst., Univ. Autonoma de Barcelona, Bellaterra, Spain
  • Volume
    12
  • Issue
    5
  • fYear
    2015
  • Firstpage
    1048
  • Lastpage
    1059
  • Abstract
    The recent advent of high-throughput sequencing machines producing big amounts of short reads has boosted the interest in efficient string searching techniques. As of today, many mainstream sequence alignment software tools rely on a special data structure, called the FM-index, which allows for fast exact searches in large genomic references. However, such searches translate into a pseudo-random memory access pattern, thus making memory access the limiting factor of all computation-efficient implementations, both on CPUs and GPUs. Here, we show that several strategies can be put in place to remove the memory bottleneck on the GPU: more compact indexes can be implemented by having more threads work cooperatively on larger memory blocks, and a k-step FM-index can be used to further reduce the number of memory accesses. The combination of those and other optimisations yields an implementation that is able to process about two Gbases of queries per second on our test platform, being about 8× faster than a comparable multi-core CPU version, and about 3× to 5× faster than the FM-index implementation on the GPU provided by the recently announced Nvidia NVBIO bioinformatics library.
  • Keywords
    bioinformatics; data structures; genomics; graphics processing units; random-access storage; GPU; Nvidia NVBIO bioinformatics library; data structure; genomic references; high-throughput sequencing machines; k-step FM-index; mainstream sequence alignment software tools; multicore CPU version; pseudorandom memory access pattern; string searching techniques; Bioinformatics; Frequency modulation; Genomics; Graphics processing units; Indexes; Instruction sets; Radiation detectors; Bioinformatics; FM-index; Fine-Grain Parallelism; GPGPU; Memory-Level Parallelism; Short Read Mapping; bioinformatics; fine-grain parallelism; memory-level parallelism; short read mapping;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2014.2377716
  • Filename
    6975110