• DocumentCode
    3540134
  • Title

    A guided dynamic programming approach for searching a set of similar DNA sequences

  • Author

    Nordin, A. R M ; Yazid, M. S M ; Aziz, A. ; Osman, M. T A

  • Author_Institution
    Fac. of Inf., Univ. Darul Iman Malaysia, Kuala Terengganu, Malaysia
  • fYear
    2009
  • fDate
    4-6 Aug. 2009
  • Firstpage
    512
  • Lastpage
    517
  • Abstract
    Optimal estimation of similarity distance between DNA sequences is performed through alignment process. This optimal alignment process is done by using dynamic programming method which running in quadratic O(ntimesm) time complexity. Filtering process is a common technique introduced to improve this optimal alignment process. A filtering process applied in heuristic tools such as BLAST and FASTA consists of scanning the exact matches of subsequences in query sequence to the sequences in database. The main purpose of filtering is to discard the irrelevant subsequences from being performed for rigorous optimal alignment process. Differently, this paper addresses the technique of filtering the expected irrelevant sequences in database from being executed for rigorous optimal alignment process. An automaton-based algorithm is used to develop the filtering process proposed. A set of random patterns is generated from query sequence will placed in automaton machine before exact matching and scoring process is performed. Extensive experiments have been carried out on several parameters and the results show that the developed filtering technique removes the unrelated targeted sequences from being aligned with query sequence.
  • Keywords
    DNA; biology computing; computational complexity; database management systems; dynamic programming; information filtering; pattern matching; query formulation; BLAST; FASTA; alignment process; automaton-based algorithm; database; filtering process; guided dynamic programming approach; heuristic tools; matching process; optimal estimation; quadratic O(ntimesm) time complexity; query sequence; scoring process; similar DNA sequences searching; similarity distance; Automata; Communications technology; DNA; Databases; Dynamic programming; Evolution (biology); Filtering; Informatics; Pattern matching; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Digital Information and Web Technologies, 2009. ICADIWT '09. Second International Conference on the
  • Conference_Location
    London
  • Print_ISBN
    978-1-4244-4456-4
  • Electronic_ISBN
    978-1-4244-4457-1
  • Type

    conf

  • DOI
    10.1109/ICADIWT.2009.5273967
  • Filename
    5273967