Title :
A Graph Based Approach to Discover Conserved Regions in DNA and Protein Sequences
Author :
Challa, Santan ; Thulasiraman, Parimala
Author_Institution :
Dept. of Comput. Sci., Manitoba Univ., Winnipeg, MB
Abstract :
This paper attempts to provide a graph based approach to discover conserved regions such as motifs in either DNA or Protein sequences. The motif discovery problem has gained lot of significance in biological science over the past decade. Lately various approaches have been used successfully to discover motifs. Some of them are based on probabilistic approach and the others on a combinatorial approach. We have followed a graph-based combinatorial approach to solve this problem, in particular, using the idea of de Bruijn graphs. The de Bruijn graph has been successfully adopted to solve problems such as local alignment and DNA fragment assembly. Our method harnesses the power of the de Bruijn graph to discover the conserved regions in a DNA or protein sequence. We have found that the algorithm was successful in mining signals for larger number of sequences and at a faster rate when compared to some popular motif searching tools.
Keywords :
DNA; biology computing; graph theory; molecular biophysics; proteins; sequences; DNA sequence; biological science; combinatorial approach; conserved region discovery; de Bruijn graph; motif discovery; probabilistic approach; protein sequence; Assembly; Bioinformatics; Biology; Computer science; Cryptography; DNA; Diseases; Evolution (biology); Genomics; Protein sequence;
Conference_Titel :
Advanced Information Networking and Applications Workshops, 2007, AINAW '07. 21st International Conference on
Conference_Location :
Niagara Falls, Ont.
Print_ISBN :
978-0-7695-2847-2
DOI :
10.1109/AINAW.2007.24