Title :
GGAKE: GPU Based Genome Assembly Using K-Mer Extension
Author :
Garg, Adesh ; Jain, Abhishek ; Paul, Kolin
Author_Institution :
Dept. of Comput. Sci. & Eng., IIT Delhi, New Delhi, India
Abstract :
The genome assembly problem involves constructing the complete genome sequence from the reads generated by the sequencers. The Next Generation Sequencing (NGS) platforms produce a large number of short reads at a very low cost. Many assemblers have been developed to work with NGS reads. The assembly process is computation intensive and also requires a large amount of memory to store the reads. Numerous efforts are being made in recent times to parallelize the assembly process in order to reduce computation time. In this paper we present the design and development of a GPU based genome assembler (GGAKE). Our assembler works using the concept of k-mer extension. Prefix and suffix k-mers are spotted out of every read. Suffix k-mers are matched with prefix k-mers and extensions for every read are noted. Contigs are generated by extending the reads. We have implemented GGAKE on Nvidia´s GPU using the CUDA programming model and benchmarked it on five bacterial genomes. Our results prove that at high coverage GGAKE is capable of producing good quality assembly in a small amount of time.
Keywords :
biology computing; cellular biophysics; genomics; graphics processing units; microorganisms; parallel architectures; parallel programming; CUDA programming model; GGAKE; GPU based genome assembler; GPU based genome assembly; NGS reads; Nvidia´s GPU; assembly process parallelization; bacterial genomes; contig generation; genome sequence; k-mer extension; next generation sequencing platforms; Assembly; Bioinformatics; DNA; Encoding; Genomics; Graphics processing units; Sorting; GPU; bioinformatics; genome assembly; parallel processing;
Conference_Titel :
High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
Conference_Location :
Zhangjiajie
DOI :
10.1109/HPCC.and.EUC.2013.156