Title :
A time efficient algorithm for finding longest common subsequence from two molecular sequences
Author :
Rizvi, S.A.M. ; Agarwal, Pankaj
Author_Institution :
Jamia Millia Islamia Univ., New Delhi
Abstract :
This paper presents a new algorithm for finding longest common subsequence from two given sequences of DNA, proteins etc of interest. We have presented a sequential algorithm, which is quite effective. Our method compares the given database sequence of DNA representing the entire/partial genetic structure of some organism (such sequences are very long), which is assumed to be stored in the database only once with a given query sequence. Although our method can be applied for protein or RNA Sequences but here we have only taken DNA Sequences. The bucket sorting method where elements are collected in buckets and then collected in some manner to get the sorted list inspires our algorithm. Our algorithm collects the index positions of the database sequence in respective buckets which becomes the input for the main algorithm. Given algorithm is quite effective as far as time complexity is concerned.
Keywords :
DNA; biology computing; computational complexity; genetics; molecular biophysics; query processing; sequences; sorting; DNA sequences; bucket sorting method; database sequence; longest common subsequence; molecular sequences; partial genetic structure; query sequence; time complexity; Computer science; DNA; Databases; Dynamic programming; Genetics; Organisms; Parallel algorithms; Protein engineering; RNA; Sequences;
Conference_Titel :
Bioengineering Conference, 2007. NEBC '07. IEEE 33rd Annual Northeast
Conference_Location :
Long Island, NY
Print_ISBN :
978-1-4244-1033-0
Electronic_ISBN :
978-1-4244-1033-0
DOI :
10.1109/NEBC.2007.4413398