• DocumentCode
    1785223
  • Title

    A fast pattern matching algorithm for highly similar sequences

  • Author

    Ben Nsira, Nadia ; Lecroq, Thierry ; Elloumi, Mourad

  • Author_Institution
    LITIS, Normandie Univ., Rouen, France
  • fYear
    2014
  • fDate
    2-5 Nov. 2014
  • Firstpage
    32
  • Lastpage
    38
  • Abstract
    With the advent of NGS technologies there are more and more genomic sequences of individuals of the same species available. These sequences only differ by a very small amount. There is thus a strong need for efficient algorithms for performing fast pattern matching in such specific sets of sequences. In this paper we propose a very efficient algorithm that solves the on-line exact pattern matching problem in a set of highly similar DNA sequences. The algorithm we propose extends variants of the Boyer-Moore exact string matching algorithm. Experimental results show that our new algorithm exhibits the best performances in practice.
  • Keywords
    DNA; bioinformatics; genomics; molecular biophysics; molecular configurations; string matching; Boyer-Moore exact string matching algorithm; NGS technologies; fast pattern matching algorithm; genomic sequences; highly similar DNA sequences; on-line exact pattern matching problem; Arrays; DNA; Educational institutions; Genomics; Hamming distance; Pattern matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
  • Conference_Location
    Belfast
  • Type

    conf

  • DOI
    10.1109/BIBM.2014.6999384
  • Filename
    6999384