DocumentCode
1785223
Title
A fast pattern matching algorithm for highly similar sequences
Author
Ben Nsira, Nadia ; Lecroq, Thierry ; Elloumi, Mourad
Author_Institution
LITIS, Normandie Univ., Rouen, France
fYear
2014
fDate
2-5 Nov. 2014
Firstpage
32
Lastpage
38
Abstract
With the advent of NGS technologies there are more and more genomic sequences of individuals of the same species available. These sequences only differ by a very small amount. There is thus a strong need for efficient algorithms for performing fast pattern matching in such specific sets of sequences. In this paper we propose a very efficient algorithm that solves the on-line exact pattern matching problem in a set of highly similar DNA sequences. The algorithm we propose extends variants of the Boyer-Moore exact string matching algorithm. Experimental results show that our new algorithm exhibits the best performances in practice.
Keywords
DNA; bioinformatics; genomics; molecular biophysics; molecular configurations; string matching; Boyer-Moore exact string matching algorithm; NGS technologies; fast pattern matching algorithm; genomic sequences; highly similar DNA sequences; on-line exact pattern matching problem; Arrays; DNA; Educational institutions; Genomics; Hamming distance; Pattern matching;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
Conference_Location
Belfast
Type
conf
DOI
10.1109/BIBM.2014.6999384
Filename
6999384
Link To Document