Title :
Mapping uniquely occurring short sequences derived from high throughput technologies to a reference genome
Author :
Antoniou, Pavlos ; Daykin, Jackie W. ; Iliopoulos, Costas S. ; Kourie, Derrick ; Mouchard, Laurent ; Pissis, Solon P.
Author_Institution :
King´´s Coll. London, London, UK
Abstract :
Novel high throughput sequencing technology methods have redefined the way genome sequencing is performed. They are able to produce tens of millions of short sequences (reads) in a single experiment and with a much lower cost than previous sequencing methods. Due to this massive amount of data generated by the above systems, efficient algorithms for mapping short sequences to a reference genome are in great demand. In this paper, we present a practical algorithm for addressing the problem of efficiently mapping uniquely occuring short reads to a reference genome. This requires the classification of these short reads into unique and duplicate matches. In particular, we define and solve the Massive Exact Unique Pattern Matching problem in genomes.
Keywords :
biology computing; genomics; molecular biophysics; molecular configurations; pattern classification; pattern matching; genome sequencing; high throughput sequencing; massive exact unique pattern matching problem; reference genome; short sequence mapping; Assembly; Bioinformatics; Computer science; Costs; DNA; Genomics; Hybrid power systems; Pattern matching; Sequences; Throughput; high throughput; mapping; pattern matching; sequencing; short reads;
Conference_Titel :
Information Technology and Applications in Biomedicine, 2009. ITAB 2009. 9th International Conference on
Conference_Location :
Larnaca
Print_ISBN :
978-1-4244-5379-5
Electronic_ISBN :
978-1-4244-5379-5
DOI :
10.1109/ITAB.2009.5394394