DocumentCode :
1191208
Title :
Heuristic Reusable Dynamic Programming: Efficient Updates of Local Sequence Alignment
Author :
Hong, Changjin ; Tewfik, Ahmed H.
Author_Institution :
Dept. of Electr. Eng., Univ. of Minnesota, Minneapolis, MN, USA
Volume :
6
Issue :
4
fYear :
2009
Firstpage :
570
Lastpage :
582
Abstract :
Recomputation of the previously evaluated similarity results between biological sequences becomes inevitable when researchers realize errors in their sequenced data or when the researchers have to compare nearly similar sequences, e.g., in a family of proteins. We present an efficient scheme for updating local sequence alignments with an affine gap model. In principle, using the previous matching result between two amino acid sequences, we perform a forward-backward alignment to generate heuristic searching bands which are bounded by a set of suboptimal paths. Given a correctly updated sequence, we initially predict a new score of the alignment path for each contour to select the best candidates among them. Then, we run the Smith-Waterman algorithm in this confined space. Furthermore, our heuristic alignment for an updated sequence shows that it can be further accelerated by using reusable dynamic programming (rDP), our prior work. In this study, we successfully validate "relative node tolerance boundrdquo (RNTB) in the pruned searching space. Furthermore, we improve the computational performance by quantifying the successful RNTB tolerance probability and switch to rDP on perturbation-resilient columns only. In our searching space derived by a threshold value of 90 percent of the optimal alignment score, we find that 98.3 percent of contours contain correctly updated paths. We also find that our method consumes only 25.36 percent of the runtime cost of sparse dynamic programming (sDP) method, and to only 2.55 percent of that of a normal dynamic programming with the Smith-Waterman algorithm.
Keywords :
biology computing; dynamic programming; heuristic programming; proteins; Smith-Waterman algorithm; affine gap model; amino acid sequence; forward-backward alignment; heuristic reusable dynamic programming; local sequence alignment; relative node tolerance bound; sparse dynamic programming; Shortest path; dynamic programming; minimum spanning tree; sensitivity analysis; sequence alignment; string edit; suboptimal paths.; Algorithms; Artificial Intelligence; Computational Biology; Computer Graphics; Humans; Pattern Recognition, Automated; Programming Languages; Proteins; Reproducibility of Results; Sequence Alignment; Sequence Analysis, Protein; Signal Processing, Computer-Assisted; Software;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2009.30
Filename :
4799770
Link To Document :
بازگشت