Title :
Modeling regulatory sites with higher order position-dependent weight matrices
Author :
Zare, Hossein ; Kaveh, Mostafa ; Khodursky, Arkady B.
Author_Institution :
Dept. of ECE, Minnesota Univ., Minneapolis, MN
fDate :
March 31 2008-April 4 2008
Abstract :
Identification of regulatory signals in DNA depends on the nature and quality of the patterns of representative sequences. These patterns are constructed from training sets of sequences by means of probabilistic models that either assume independence between positions or that suffer from considerable computational complexity. We have developed and tested higher order models that account for significant dependent position pairs or triads, thereby capturing position-dependent information hidden in DNA binding sites. We have evaluated our algorithm on several data sets, including eukaryotic and bacterial transcription factor binding sites and shown that the scores from the higher order representation of binding sites have significant positive correlation to the binding affinity scores.
Keywords :
DNA; biology computing; computational complexity; computational complexity; eukaryotic; higher order position-dependent weight matrices; higher order representation; probabilistic models; regulatory signal identification; regulatory sites; representative sequences; transcription factor binding sites; Bioinformatics; Computational complexity; DNA; Genomics; Microorganisms; Proteins; Pulse width modulation; Sequences; Signal processing; Testing; DNA Binding sites; Position weight matrix; Regulatory signal; Transcription factor;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517688