Title : 
Multiple Sequence Alignment Containing a Sequence of Regular Expressions
         
        
            Author : 
Arslan, Abdullah N.
         
        
            Author_Institution : 
Department of Computer Science The University of Vermont Burlington, VT 05405, USA, aarslan@cs.uvm.edu
         
        
        
        
        
        
            Abstract : 
A classical algorithm for the pairwise sequence alignment is the Smith Waterman algorithm which uses dynamic programming. The algorithm computes the maximum score of alignments that use insertions, deletions, and substitutions, with no consideration given in composition of the alignments. However, biologists favor applying their knowledge about common structures or functions into the alignment process. For alignment of protein sequences, several methods have been suggested for taking into account the motifs (a restricted regular expression) from the PROSITE database to guide alignments. One method modifies the Smith Waterman dynamic programming solution to reward alignments that contain matching motifs. Another method introduces the regular expression constrained sequence alignment problem in which pairwise alignments are constrained to contain a given regular expression. This latter method constructs a weighted finite automaton from a given regular expression, and presents a dynamic programming solution that simulates copies of this automaton in seeking an alignment with maximum score containing the regular expression. We generalize this approach: 1) We introduce a variation of the problem for multiple sequences, namely the regular expression constrained multiple sequence alignment, and present an algorithm for it; 2) We develop an algorithm for the case of the problem when the alignments sought are required to contain a given sequence of regular expressions.
         
        
            Keywords : 
Automata; Biology computing; Computer science; Databases; Degradation; Dynamic programming; Heuristic algorithms; Proteins; RNA; Writing;
         
        
        
        
            Conference_Titel : 
Computational Intelligence in Bioinformatics and Computational Biology, 2005. CIBCB '05. Proceedings of the 2005 IEEE Symposium on
         
        
            Print_ISBN : 
0-7803-9387-2
         
        
        
            DOI : 
10.1109/CIBCB.2005.1594922