• DocumentCode
    166689
  • Title

    SWAPHI-LS: Smith-Waterman Algorithm on Xeon Phi coprocessors for Long DNA Sequences

  • Author

    Yongchao Liu ; Tuan-Tu Tran ; Lauenroth, Felix ; Schmidt, Benedikt

  • Author_Institution
    Inst. fur Inf., Johannes Gutenberg Univ. Mainz, Mainz, Germany
  • fYear
    2014
  • fDate
    22-26 Sept. 2014
  • Firstpage
    257
  • Lastpage
    265
  • Abstract
    As an optimal method for sequence alignment, the Smith-Waterman (SW) algorithm is widely used. Unfortunately, this algorithm is computationally demanding, especially for long sequences. This has motivated the investigation of its acceleration on a variety of high-performance computing platforms. However, most work in the literature is only suitable for short sequences. In this paper, we present SWAPHI-LS, the first parallel SW algorithm exploiting emerging Xeon Phi coprocessors to accelerate the alignment of long DNA sequences. In SWAPHI-LS, we have investigated three parallelization approaches (naïve, tiled, and distributed) in order to deeply explore the inherent parallelism within Xeon Phis. To achieve high speed, we have explored two levels of parallelism within a single Xeon Phi and one more level of parallelism between Xeon Phis. Within a single Xeon Phi we exploit instruction-level parallelism within 512-bit single instruction multiple data (SIMD) instructions (vectorization) as well as thread-level parallelism over the many cores (multi-threading). Between Xeon Phis we employ device-level parallelism in order to harness the compute power of Xeon Phi clusters (distributed computing based on the MPI offload model). The performance of our algorithm has been evaluated using a variety of genome sequences of lengths ranging from 4.4 million to 50 million nucleotides. Our performance evaluation reveals that our implementation achieves a stable performance of up to 30.1 billion cell updates per second (GCUPS) on a single Xeon Phi and up to 111.4 GCUPS on four Xeon Phis sharing the same host. SWAPHI-LS is written in C++ (with a set of SIMD intrinsics), OpenMP and MPI. The source code is publicly available at http://swaphi-ls.sourceforge.net.
  • Keywords
    C++ language; DNA; bioinformatics; coprocessors; multi-threading; multiprocessing systems; parallel algorithms; C++; MPI offload model; OpenMP; SIMD instructions; SWAPHI-LS; Smith-Waterman Algorithm on Xeon Phi Coprocessors for Long DNA Sequences; Xeon Phi clusters; boinformatics; device-level parallelism; distributed computing; genome sequences; instruction-level parallelism; many cores; multithreading; nucleotides; parallel SW algorithm; single instruction multiple data; source code; thread-level parallelism; vectorization; Buffer storage; Computational modeling; Computer architecture; Instruction sets; Kernel; Parallel processing; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing (CLUSTER), 2014 IEEE International Conference on
  • Conference_Location
    Madrid
  • Type

    conf

  • DOI
    10.1109/CLUSTER.2014.6968772
  • Filename
    6968772