DocumentCode :
3411018
Title :
An exhaustive genome assembly algorithm using k-mers to indirectly perform N-squared comparisons in O(N)
Author :
Shah, Maulik K. ; Lee, HoJoon ; Rogers, Stephanie A. ; Touchman, Jeffrey W.
Author_Institution :
Arizona State Univ., Tempe, AZ, USA
fYear :
2004
fDate :
16-19 Aug. 2004
Firstpage :
740
Lastpage :
741
Abstract :
We present an algorithm that indirectly makes N2 sequence comparisons in O(N) with respect to the size of the genome. This algorithm is very applicable in assembling whole genomes from the thousands of DNA sequence fragments that are generated in shotgun sequencing. First, we assume that fragments that share k-mers should overlap in the final assembly. We then catalog all k-mers that exist in the shotgun library and infer links between fragments that share k-mers. These links are then used to represent edges in a graph. This graph is generated in O(N) yet represents the result of comparing every fragment to every other fragment.
Keywords :
DNA; biology computing; genetics; graphs; molecular biophysics; DNA sequence; N-squared comparisons; O(N); exhaustive genome assembly algorithm; graph; k-mers; shotgun sequencing; Assembly; Bioinformatics; Capacitive sensors; DNA; Data structures; Genomics; Libraries; Sequences; Windows;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN :
0-7695-2194-0
Type :
conf
DOI :
10.1109/CSB.2004.1332565
Filename :
1332565
Link To Document :
بازگشت