DocumentCode :
2341496
Title :
A maximum entropy algorithm for rhythmic analysis of genome-wide expression patterns
Author :
Langmead, Christopher James ; McClung, C. Robertson ; Donald, Bruce Randall
Author_Institution :
Dept. of Comput. Sci., Dartmouth Coll., Hanover, NH, USA
fYear :
2002
fDate :
2002
Firstpage :
237
Lastpage :
245
Abstract :
We introduce a maximum entropy-based analysis technique for extracting and characterizing rhythmic expression profiles from DNA microarray hybridization data. These patterns are clues to discovering genes implicated in cell-cycle, circadian, and other periodic biological processes. The algorithm, implemented in a program called ENRAGE (Entropy-based Rhythmic Analysis of Gene Expression), treats the task of estimating an expression profile´s periodicity and phase as a simultaneous bicriterion optimization problem. Specifically, a frequency domain spectrum is reconstructed from a time-series of gene expression data, subject to two constraints: (a) the likelihood of the spectrum and (b) the Shannon entropy of the reconstructed spectrum. Unlike Fourier-based spectral analysis, maximum entropy spectral reconstruction is well suited to signals of the type generated in DNA microarray experiments. Our algorithm is optimal, running in linear time in the number of expression profiles. Moreover an implementation of our algorithm runs an order of magnitude faster than previous methods. Finally, we demonstrate that ENRAGE is superior to other methods at identifying and characterizing periodic expression profiles on both synthetic and actual DNA microarray hybridization data.
Keywords :
DNA; biology computing; genetics; maximum entropy methods; optimisation; pattern matching; time series; DNA microarray hybridization data; ENRAGE; Entropy-based Rhythmic Analysis of Gene Expression; Shannon entropy; bicriterion optimization; biological processes; frequency domain spectrum; genome-wide expression patterns; maximum entropy algorithm; pattern matching; rhythmic analysis; rhythmic expression profiles; time-series; Algorithm design and analysis; Bioinformatics; Biological processes; DNA; Data mining; Entropy; Gene expression; Genomics; Pattern analysis; Phase estimation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics Conference, 2002. Proceedings. IEEE Computer Society
Print_ISBN :
0-7695-1653-X
Type :
conf
DOI :
10.1109/CSB.2002.1039346
Filename :
1039346
Link To Document :
بازگشت