Title :
A guided dynamic programming approach for searching a set of similar DNA sequences
Author :
Nordin, A. R M ; Yazid, M. S M ; Aziz, A. ; Osman, M. T A
Author_Institution :
Fac. of Inf., Univ. Darul Iman Malaysia, Kuala Terengganu, Malaysia
Abstract :
Optimal estimation of similarity distance between DNA sequences is performed through alignment process. This optimal alignment process is done by using dynamic programming method which running in quadratic O(ntimesm) time complexity. Filtering process is a common technique introduced to improve this optimal alignment process. A filtering process applied in heuristic tools such as BLAST and FASTA consists of scanning the exact matches of subsequences in query sequence to the sequences in database. The main purpose of filtering is to discard the irrelevant subsequences from being performed for rigorous optimal alignment process. Differently, this paper addresses the technique of filtering the expected irrelevant sequences in database from being executed for rigorous optimal alignment process. An automaton-based algorithm is used to develop the filtering process proposed. A set of random patterns is generated from query sequence will placed in automaton machine before exact matching and scoring process is performed. Extensive experiments have been carried out on several parameters and the results show that the developed filtering technique removes the unrelated targeted sequences from being aligned with query sequence.
Keywords :
DNA; biology computing; computational complexity; database management systems; dynamic programming; information filtering; pattern matching; query formulation; BLAST; FASTA; alignment process; automaton-based algorithm; database; filtering process; guided dynamic programming approach; heuristic tools; matching process; optimal estimation; quadratic O(ntimesm) time complexity; query sequence; scoring process; similar DNA sequences searching; similarity distance; Automata; Communications technology; DNA; Databases; Dynamic programming; Evolution (biology); Filtering; Informatics; Pattern matching; Sequences;
Conference_Titel :
Applications of Digital Information and Web Technologies, 2009. ICADIWT '09. Second International Conference on the
Conference_Location :
London
Print_ISBN :
978-1-4244-4456-4
Electronic_ISBN :
978-1-4244-4457-1
DOI :
10.1109/ICADIWT.2009.5273967