Title :
GWISFI: A universal GPU interface for exhaustive search of pairwise interactions in case-control GWAS in minutes
Author :
Qiao Wang ; Fan Shi ; Kowalczyk, Adam ; Campbell, Richard M. ; Goudey, Benjamin ; Rawlinson, David ; Harwood, Aaron ; Ferra, Herman ; Kowalczyk, Adam
Author_Institution :
NICTA Victorian Res. Lab., Univ. of Melbourne, Melbourne, VIC, Australia
Abstract :
Epistatic interactions between genes are believed to be a critical component in the genetic architecture of complex diseases. Genome Wide Association Studies (GWAS) may be able to detect such genetic interactions indirectly, via the identification of associated SNP markers. Major obstacles to progress in this area are: the unknown nature of epistatic interactions, little understanding of the capabilities of different filtering methods, and the computational difficulties for exhaustive analysis. A common platform enabling various detection methods is needed to avoid practical issues such as software compatibility and portability, incompatible input and output formats and varying demands on computational resources. We developed a highly optimised GPU system capable of exhaustively analysing all SNP-pairs in typical GWAS data (0.5M SNPs, 5K samples) in a few minutes on a standard desktop computer. A number of programming elements provided by a functional interface can be used to construct user-defined statistical tests to efficiently score every SNP pair. As a proof of principle, we have implemented 8 methods from the literature via our interface. We have applied all of them using a single GPU to exhaustively scan the 7 popular WTCCC case-control GWAS datasets. We present timing results for these methods, both in their original software implementations and using our platform. Significant improvements in timing are observed, up to 10000 times for CPU implementations of the popular FastEpistasis in PLINK and up to 2 orders of magnitude for some GPU implementations in the literature. As an initial discovery we show plots for overlaps of list of selected pairs by 8 algorithms for Type 2 Diabetes, WTCCC data.
Keywords :
bioinformatics; diseases; genetics; genomics; graphical user interfaces; graphics processing units; FastEpistasis; GPU implementation; GWISFI; Genome Wide Association Studies; PLINK; SNP-pairs; Type 2 Diabetes; WTCCC case-control GWAS datasets; WTCCC data; associated SNP marker identification; complex diseases; computational difficulties; computational resources; detection methods; epistatic interactions; exhaustive analysis; filtering methods; functional interface; genetic architecture; genetic interactions; highly optimised GPU system; incompatible input formats; incompatible output formats; orders of magnitude; original software implementations; pairwise interactions; portability; programming elements; single GPU; software compatibility; standard desktop computer; universal GPU interface; user-defined statistical tests; Diseases; Graphics processing units; Kernel; Runtime; Software algorithms; Standards; GWISFI: bioinformatics.research.nicta.com.au/gwisfi;
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
Conference_Location :
Belfast
DOI :
10.1109/BIBM.2014.6999192