• DocumentCode
    1147410
  • Title

    A uniform projection method for motif discovery in DNA sequences

  • Author

    Raphael, Benjamin ; Liu, Lung-Tien ; Varghese, George

  • Volume
    1
  • Issue
    2
  • fYear
    2004
  • Firstpage
    91
  • Lastpage
    94
  • Abstract
    Buhler and Tompa (2002) introduced the random projection algorithm for the motif discovery problem and demonstrated that this algorithm performs well on both simulated and biological samples. We describe a modification of the random projection algorithm, called the uniform projection algorithm, which utilizes a different choice of projections. We replace the random selection of projections by a greedy heuristic that approximately equalizes the coverage of the projections. We show that this change in selection of projections leads to improved performance on motif discovery problems. Furthermore, the uniform projection algorithm is directly applicable to other problems where the random projection algorithm has been used, including comparison of protein sequence databases.
  • Keywords
    DNA; biology computing; molecular biophysics; DNA sequences; greedy heuristic; motif discovery; protein sequence databases; uniform projection method; Biological system modeling; Computational biology; DNA; Databases; Gene expression; Genetic mutations; Optimization methods; Pattern matching; Projection algorithms; Protein sequence; Index Terms- Motif discovery; combinatorial designs; low-discrepancy sequences.; random projection; transcription factor binding sites; Algorithms; Binding Sites; Computational Biology; DNA; Monte Carlo Method; Mutation; Regulatory Elements, Transcriptional; Transcription Factors;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2004.14
  • Filename
    1350751