DocumentCode
3411018
Title
An exhaustive genome assembly algorithm using k-mers to indirectly perform N-squared comparisons in O(N)
Author
Shah, Maulik K. ; Lee, HoJoon ; Rogers, Stephanie A. ; Touchman, Jeffrey W.
Author_Institution
Arizona State Univ., Tempe, AZ, USA
fYear
2004
fDate
16-19 Aug. 2004
Firstpage
740
Lastpage
741
Abstract
We present an algorithm that indirectly makes N2 sequence comparisons in O(N) with respect to the size of the genome. This algorithm is very applicable in assembling whole genomes from the thousands of DNA sequence fragments that are generated in shotgun sequencing. First, we assume that fragments that share k-mers should overlap in the final assembly. We then catalog all k-mers that exist in the shotgun library and infer links between fragments that share k-mers. These links are then used to represent edges in a graph. This graph is generated in O(N) yet represents the result of comparing every fragment to every other fragment.
Keywords
DNA; biology computing; genetics; graphs; molecular biophysics; DNA sequence; N-squared comparisons; O(N); exhaustive genome assembly algorithm; graph; k-mers; shotgun sequencing; Assembly; Bioinformatics; Capacitive sensors; DNA; Data structures; Genomics; Libraries; Sequences; Windows;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN
0-7695-2194-0
Type
conf
DOI
10.1109/CSB.2004.1332565
Filename
1332565
Link To Document