• DocumentCode
    3460413
  • Title

    Practical and Efficient Algorithms for Degenerate and Weighted Sequences Derived from High Throughput Sequencing Technologies

  • Author

    Antoniou, Pavlos ; Iliopoulos, Costas S. ; Mouchard, Laurent ; Pissis, Solon P.

  • Author_Institution
    Dept. of Comput. Sci., King´´s Coll. London, London, UK
  • fYear
    2009
  • fDate
    3-5 Aug. 2009
  • Firstpage
    174
  • Lastpage
    180
  • Abstract
    High throughput, (or next generation) sequencing technologies have opened new and exciting opportunities in the use of DNA sequences. The new emerging technologies mark the beginning of a new era of high throughput short read sequencing: they have the potential to assemble a bacterial genome during a single experiment and at a moderate cost. In this paper, we address the problem of efficiently mapping millions of degenerate and weighted sequences to a reference genome with respect to whether they occur exactly once in the genome or not, and by taking probability scores into consideration. In particular, we define and solve the Massive Exact and Approximate Unique Pattern Matching problem for degenerate and weighted sequences derived from high throughput sequencing technologies.
  • Keywords
    DNA; bioinformatics; genomics; microorganisms; molecular biophysics; pattern matching; probability; DNA sequences; bacterial genome; mapping algorithm; massive exact-and-approximate unique pattern matching problem; probability scores; weighted sequences; Assembly; Bioinformatics; Computer science; Costs; DNA; Genomics; Microorganisms; Pattern matching; Sequences; Throughput; degenerate sequences; high throughput sequencing technologies; weighted sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics, Systems Biology and Intelligent Computing, 2009. IJCBS '09. International Joint Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3739-9
  • Type

    conf

  • DOI
    10.1109/IJCBS.2009.48
  • Filename
    5260703