• DocumentCode
    260366
  • Title

    Effect of Multi-K Contig Merging in de novo DNA Assembly

  • Author

    Goodarzi, Mohammad ; Houghten, Sheridan ; Ping Liang

  • Author_Institution
    Dept. of Comput. Sci., Brock Univ., St. Catharines, ON, Canada
  • fYear
    2014
  • fDate
    10-12 Nov. 2014
  • Firstpage
    355
  • Lastpage
    361
  • Abstract
    DNA Assembly is among the most fundamental and challenging problems in bioinformatics. Near optimal solutions are available for bacterial and small genomes. However assembling large and complex genomes including the human genome using Next-Generation-Sequencing (NGS) technologies is shown to be very difficult. This paper presents an algorithm for creating contigs from NGS short read data that is capable of working with multiple k-mer lengths and introduces a technique to combine contigs generated from different k runs with results from other assemblers in order to obtain significantly better assemblies. Experimental results from 9 real datasets show an increase in N50 value by a factor of 3, when combining newly created contigs with results from other assemblers.
  • Keywords
    DNA; bioinformatics; genomics; merging; molecular biophysics; molecular configurations; bacterial genomes; bioinformatics; de novo DNA assembly; human genome; multik contig merging; multiple k-mer length; next-generation-sequencing technology; Assembly; Bioinformatics; DNA; Educational institutions; Genomics; Memory management; Merging; Contig Merging; Multi-K Assembly; de novo DNA Assembly;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering (BIBE), 2014 IEEE International Conference on
  • Conference_Location
    Boca Raton, FL
  • Type

    conf

  • DOI
    10.1109/BIBE.2014.49
  • Filename
    7033606