DocumentCode :
3204768
Title :
A fast algorithm on average for all-against-all sequence matching
Author :
Baeza-Yates, Ricardo A. ; Gonnet, Gaston H.
Author_Institution :
Dept. de Ciencias de la Comput., Chile Univ., Santiago, Chile
fYear :
1999
fDate :
1999
Firstpage :
16
Lastpage :
23
Abstract :
We present an algorithm which attempts to align pairs of subsequences from a database of genetic sequences. The algorithm simulates the classical dynamic programming alignment algorithm over a suffix array of the database. We provide a detailed average case analysis which shows that the running time of the algorithm is subquadratic with respect to the database size. A similar algorithm solves the approximate string matching problem in sublinear average time
Keywords :
biology computing; computational complexity; database theory; dynamic programming; molecular biophysics; string matching; DNA; all-against-all sequence matching; approximate string matching problem; average case analysis; dynamic programming alignment algorithm; fast algorithm; genetic sequences; peptide sequence database; running time; suffix array; Algorithm design and analysis; Computational modeling; DNA; Databases; Dynamic programming; Genetics; Heuristic algorithms; Impedance matching; Peptides; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
String Processing and Information Retrieval Symposium, 1999 and International Workshop on Groupware
Conference_Location :
Cancun
Print_ISBN :
0-7695-0268-7
Type :
conf
DOI :
10.1109/SPIRE.1999.796573
Filename :
796573
Link To Document :
بازگشت