DocumentCode :
3406242
Title :
Modeling regulatory sites with higher order position-dependent weight matrices
Author :
Zare, Hossein ; Kaveh, Mostafa ; Khodursky, Arkady B.
Author_Institution :
Dept. of ECE, Minnesota Univ., Minneapolis, MN
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
629
Lastpage :
632
Abstract :
Identification of regulatory signals in DNA depends on the nature and quality of the patterns of representative sequences. These patterns are constructed from training sets of sequences by means of probabilistic models that either assume independence between positions or that suffer from considerable computational complexity. We have developed and tested higher order models that account for significant dependent position pairs or triads, thereby capturing position-dependent information hidden in DNA binding sites. We have evaluated our algorithm on several data sets, including eukaryotic and bacterial transcription factor binding sites and shown that the scores from the higher order representation of binding sites have significant positive correlation to the binding affinity scores.
Keywords :
DNA; biology computing; computational complexity; computational complexity; eukaryotic; higher order position-dependent weight matrices; higher order representation; probabilistic models; regulatory signal identification; regulatory sites; representative sequences; transcription factor binding sites; Bioinformatics; Computational complexity; DNA; Genomics; Microorganisms; Proteins; Pulse width modulation; Sequences; Signal processing; Testing; DNA Binding sites; Position weight matrix; Regulatory signal; Transcription factor;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4517688
Filename :
4517688
Link To Document :
بازگشت