• DocumentCode
    2842354
  • Title

    An Efficient Combinatorial Approach for Solving the DNA Motif Finding Problem

  • Author

    Geraci, Filippo ; Pellegrini, Marco ; Renda, M. Elena

  • Author_Institution
    Ist. di Inf. e Telematica, Consiglio Naz. delle Ric., Pisa, Italy
  • fYear
    2009
  • fDate
    Nov. 30 2009-Dec. 2 2009
  • Firstpage
    335
  • Lastpage
    340
  • Abstract
    The detection of an over-represented sub-sequence in a set of (carefully chosen) DNA sequences is often the main clue leading to the investigation of a possible functional role for such a subsequence. Over-represented substrings (with possibly local mutations) in a biological string are termed motifs. A typical functional unit that can be modeled by a motif is a Transcription Factor Binding Site (TFBS), a portion of the DNA sequence apt to the binding of a protein that participates in complex transcriptomic biochemical reactions. In the literature it has been proposed a simplified combinatorial problem called the planted (l-d)-motif problem (known also as the (l-d) Challenge Problem) that captures the essential combinatorial nature of the motif finding problem. In this paper we propose a novel graph-based algorithm for solving a refinement of the (l-d) Challenge Problem. Experimental results show that instances of the (l-d) Challenge Problem considered difficult for competing state of the art methods in literature can be solved efficiently in our framework.
  • Keywords
    DNA; biology computing; genetics; graph theory; (l-d) challenge problem; DNA motif finding problem; DNA sequences; biological string; combinatorial approach; combinatorial problem; complex transcriptomic biochemical reactions; graph-based algorithm; over-represented sub-sequence; over-represented substrings; planted (l-d)-motif problem; transcription factor binding site; Algorithm design and analysis; Biological system modeling; DNA; Diseases; Evolution (biology); Genetic mutations; Intelligent systems; Machinery; Proteins; Sequences; (l-d) Challenge Problem; TFBS detection; graph-based algorithm; motif finding problem;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications, 2009. ISDA '09. Ninth International Conference on
  • Conference_Location
    Pisa
  • Print_ISBN
    978-1-4244-4735-0
  • Electronic_ISBN
    978-0-7695-3872-3
  • Type

    conf

  • DOI
    10.1109/ISDA.2009.41
  • Filename
    5364851