DocumentCode
1147410
Title
A uniform projection method for motif discovery in DNA sequences
Author
Raphael, Benjamin ; Liu, Lung-Tien ; Varghese, George
Volume
1
Issue
2
fYear
2004
Firstpage
91
Lastpage
94
Abstract
Buhler and Tompa (2002) introduced the random projection algorithm for the motif discovery problem and demonstrated that this algorithm performs well on both simulated and biological samples. We describe a modification of the random projection algorithm, called the uniform projection algorithm, which utilizes a different choice of projections. We replace the random selection of projections by a greedy heuristic that approximately equalizes the coverage of the projections. We show that this change in selection of projections leads to improved performance on motif discovery problems. Furthermore, the uniform projection algorithm is directly applicable to other problems where the random projection algorithm has been used, including comparison of protein sequence databases.
Keywords
DNA; biology computing; molecular biophysics; DNA sequences; greedy heuristic; motif discovery; protein sequence databases; uniform projection method; Biological system modeling; Computational biology; DNA; Databases; Gene expression; Genetic mutations; Optimization methods; Pattern matching; Projection algorithms; Protein sequence; Index Terms- Motif discovery; combinatorial designs; low-discrepancy sequences.; random projection; transcription factor binding sites; Algorithms; Binding Sites; Computational Biology; DNA; Monte Carlo Method; Mutation; Regulatory Elements, Transcriptional; Transcription Factors;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2004.14
Filename
1350751
Link To Document