Title :
Parallelizing Epistasis Detection in GWAS on FPGA and GPU-Accelerated Computing Systems
Author :
Gonzalez-Dominguez, Jorge ; Wienbrandt, Lars ; Kassens, Jan Christian ; Ellinghaus, David ; Schimmler, Manfred ; Schmidt, Bertil
Author_Institution :
Inst. of Comput. Sci., Johannes Gutenberg Univ. Mainz, Mainz, Germany
Abstract :
High-throughput genotyping technologies (such as SNP-arrays) allow the rapid collection of up to a few million genetic markers of an individual. Detecting epistasis (based on 2-SNP interactions) in Genome-Wide Association Studies is an important but time consuming operation since statistical computations have to be performed for each pair of measured markers. Computational methods to detect epistasis therefore suffer from prohibitively long runtimes; e.g., processing a moderately-sized dataset consisting of about 500,000 SNPs and 5,000 samples requires several days using state-of-the-art tools on a standard 3 GHz CPU. In this paper, we demonstrate how this task can be accelerated using a combination of fine-grained and coarse-grained parallelism on two different computing systems. The first architecture is based on reconfigurable hardware (FPGAs) while the second architecture uses multiple GPUs connected to the same host. We show that both systems can achieve speedups of around four orders-of-magnitude compared to the sequential implementation. This significantly reduces the runtimes for detecting epistasis to only a few minutes for moderatelysized datasets and to a few hours for large-scale datasets.
Keywords :
biology computing; field programmable gate arrays; genetics; genomics; graphics processing units; parallel algorithms; statistical analysis; FPGA; GPU-accelerated computing systems; GWAS; Genome-Wide Association Studies; SNP; coarse-grained parallelism; epistasis detection; fine-grained parallelism; frequency 3 GHz; genetic markers; high-throughput genotyping technologies; large-scale datasets; orders-of-magnitude; reconfigurable hardware; statistical computations; Bioinformatics; Computational biology; Computer architecture; Field programmable gate arrays; Graphics processing units; Random access memory; FPGA; GPU; GWAS; contingency tables; epistasis; pairwise gene-gene interaction; parallel computing;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2015.2389958