DocumentCode :
454971
Title :
Handling Updates of a Pairwise Sequence Alignment
Author :
Hong, Changjin ; Tewfik, Ahmed H.
Author_Institution :
Dept. of Electr. & Comput. Eng., Minnesota Univ.
Volume :
2
fYear :
2006
fDate :
14-19 May 2006
Abstract :
Sequence alignment or decoding in molecular biology is mostly done via computationally expensive dynamic programming (DP) based approaches. Unfortunately, as sequencing errors are discovered frequently, researchers must repeat all previous similarity analysis for the erroneous sequence. This can take hours or days. In this work, we derive relative tolerance bounds on node distances from a root node that guarantee that partial shortest path distances remain optimal. We then propose an algorithm that uses these bounds to skip all unperturbed parts of a sequence when recomputing an alignment. We also discuss techniques to reduce the memory requirements of the algorithm by focusing on the highly conserved segments of the sequence. Experimental results establish that our proposed alignment procedure can update alignment decisions of modified sequence with 4.6% to 18% of the number of computations required by the normal Needleman-Wunsch algorithm, depending on sequence length. Higher computational savings are achieved with longer sequences
Keywords :
decoding; dynamic programming; molecular biophysics; molecular orientation; Needleman-Wunsch algorithm; decoding; dynamic programming; molecular biology; pairwise sequence alignment; sequencing errors; Biological information theory; Biology computing; Collaboration; Computer errors; Databases; Decoding; Dynamic programming; Polymers; Sequences; Tree graphs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660540
Filename :
1660540
Link To Document :
بازگشت