DocumentCode :
583228
Title :
De novo co-assembly of bacterial genomes from multiple single cells
Author :
Movahedi, Narjes S. ; Forouzmand, Elmirasadat ; Chitsaz, Hamidreza
Author_Institution :
Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
fYear :
2012
fDate :
4-7 Oct. 2012
Firstpage :
1
Lastpage :
5
Abstract :
Recent progress in DNA amplification techniques, particularly multiple displacement amplification (MDA), has made it possible to sequence and assemble bacterial genomes from a single cell. However, the quality of single cell genome assembly has not yet reached the quality of normal multiceli genome assembly due to the coverage bias and errors caused by MDA. Using a template of more than one cell for MDA or combining separate MDA products has been shown to improve the result of genome assembly from few single cells, but providing identical single cells, as a necessary step for these approaches, is a challenge. As a solution to this problem, we give an algorithm for de novo co-assembly of bacterial genomes from multiple single cells. Our novel method not only detects the outlier cells in a pool, it also identifies and eliminates their genomic sequences from the final assembly. Our proposed co-assembly algorithm is based on colored de Bruijn graph which has been recently proposed for de novo structural variation detection. Our results show that de novo co-assembly of bacterial genomes from multiple single cells outperforms single cell assembly of each individual one in all standard metrics. Moreover, co-assembly outperforms mixed assembly in which the input datasets are simply concatenated. We implemented our algorithm in a software tool called HyDA which is available from http://compbio.cs.wayne.edu/software/hyda.
Keywords :
DNA; bioinformatics; cellular biophysics; genetics; genomics; microorganisms; molecular biophysics; molecular configurations; self-assembly; Bruijn graph; DNA amplification; HyDA software tool; bacterial genome assembly; bacterial genome sequence; de novo coassembly; de novo structural variation detection; input datasets; multiple displacement amplification; multiple single cells; normal multicell genome assembly; single cell genome assembly; standard metrics; Assembly; Bioinformatics; Color; DNA; Genomics; Image color analysis; Microorganisms; colored de Bruijn graph; sequence assembly; single cell genomics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-1-4673-2559-2
Electronic_ISBN :
978-1-4673-2558-5
Type :
conf
DOI :
10.1109/BIBM.2012.6392618
Filename :
6392618
Link To Document :
بازگشت