• DocumentCode
    464225
  • Title

    A Graph Based Approach to Discover Conserved Regions in DNA and Protein Sequences

  • Author

    Challa, Santan ; Thulasiraman, Parimala

  • Author_Institution
    Dept. of Comput. Sci., Manitoba Univ., Winnipeg, MB
  • Volume
    1
  • fYear
    2007
  • fDate
    21-23 May 2007
  • Firstpage
    672
  • Lastpage
    677
  • Abstract
    This paper attempts to provide a graph based approach to discover conserved regions such as motifs in either DNA or Protein sequences. The motif discovery problem has gained lot of significance in biological science over the past decade. Lately various approaches have been used successfully to discover motifs. Some of them are based on probabilistic approach and the others on a combinatorial approach. We have followed a graph-based combinatorial approach to solve this problem, in particular, using the idea of de Bruijn graphs. The de Bruijn graph has been successfully adopted to solve problems such as local alignment and DNA fragment assembly. Our method harnesses the power of the de Bruijn graph to discover the conserved regions in a DNA or protein sequence. We have found that the algorithm was successful in mining signals for larger number of sequences and at a faster rate when compared to some popular motif searching tools.
  • Keywords
    DNA; biology computing; graph theory; molecular biophysics; proteins; sequences; DNA sequence; biological science; combinatorial approach; conserved region discovery; de Bruijn graph; motif discovery; probabilistic approach; protein sequence; Assembly; Bioinformatics; Biology; Computer science; Cryptography; DNA; Diseases; Evolution (biology); Genomics; Protein sequence;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications Workshops, 2007, AINAW '07. 21st International Conference on
  • Conference_Location
    Niagara Falls, Ont.
  • Print_ISBN
    978-0-7695-2847-2
  • Type

    conf

  • DOI
    10.1109/AINAW.2007.24
  • Filename
    4221135