DocumentCode :
260366
Title :
Effect of Multi-K Contig Merging in de novo DNA Assembly
Author :
Goodarzi, Mohammad ; Houghten, Sheridan ; Ping Liang
Author_Institution :
Dept. of Comput. Sci., Brock Univ., St. Catharines, ON, Canada
fYear :
2014
fDate :
10-12 Nov. 2014
Firstpage :
355
Lastpage :
361
Abstract :
DNA Assembly is among the most fundamental and challenging problems in bioinformatics. Near optimal solutions are available for bacterial and small genomes. However assembling large and complex genomes including the human genome using Next-Generation-Sequencing (NGS) technologies is shown to be very difficult. This paper presents an algorithm for creating contigs from NGS short read data that is capable of working with multiple k-mer lengths and introduces a technique to combine contigs generated from different k runs with results from other assemblers in order to obtain significantly better assemblies. Experimental results from 9 real datasets show an increase in N50 value by a factor of 3, when combining newly created contigs with results from other assemblers.
Keywords :
DNA; bioinformatics; genomics; merging; molecular biophysics; molecular configurations; bacterial genomes; bioinformatics; de novo DNA assembly; human genome; multik contig merging; multiple k-mer length; next-generation-sequencing technology; Assembly; Bioinformatics; DNA; Educational institutions; Genomics; Memory management; Merging; Contig Merging; Multi-K Assembly; de novo DNA Assembly;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering (BIBE), 2014 IEEE International Conference on
Conference_Location :
Boca Raton, FL
Type :
conf
DOI :
10.1109/BIBE.2014.49
Filename :
7033606
Link To Document :
بازگشت