• DocumentCode
    688265
  • Title

    GGAKE: GPU Based Genome Assembly Using K-Mer Extension

  • Author

    Garg, Adesh ; Jain, Abhishek ; Paul, Kolin

  • Author_Institution
    Dept. of Comput. Sci. & Eng., IIT Delhi, New Delhi, India
  • fYear
    2013
  • fDate
    13-15 Nov. 2013
  • Firstpage
    1105
  • Lastpage
    1112
  • Abstract
    The genome assembly problem involves constructing the complete genome sequence from the reads generated by the sequencers. The Next Generation Sequencing (NGS) platforms produce a large number of short reads at a very low cost. Many assemblers have been developed to work with NGS reads. The assembly process is computation intensive and also requires a large amount of memory to store the reads. Numerous efforts are being made in recent times to parallelize the assembly process in order to reduce computation time. In this paper we present the design and development of a GPU based genome assembler (GGAKE). Our assembler works using the concept of k-mer extension. Prefix and suffix k-mers are spotted out of every read. Suffix k-mers are matched with prefix k-mers and extensions for every read are noted. Contigs are generated by extending the reads. We have implemented GGAKE on Nvidia´s GPU using the CUDA programming model and benchmarked it on five bacterial genomes. Our results prove that at high coverage GGAKE is capable of producing good quality assembly in a small amount of time.
  • Keywords
    biology computing; cellular biophysics; genomics; graphics processing units; microorganisms; parallel architectures; parallel programming; CUDA programming model; GGAKE; GPU based genome assembler; GPU based genome assembly; NGS reads; Nvidia´s GPU; assembly process parallelization; bacterial genomes; contig generation; genome sequence; k-mer extension; next generation sequencing platforms; Assembly; Bioinformatics; DNA; Encoding; Genomics; Graphics processing units; Sorting; GPU; bioinformatics; genome assembly; parallel processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
  • Conference_Location
    Zhangjiajie
  • Type

    conf

  • DOI
    10.1109/HPCC.and.EUC.2013.156
  • Filename
    6832038