• DocumentCode
    2341496
  • Title

    A maximum entropy algorithm for rhythmic analysis of genome-wide expression patterns

  • Author

    Langmead, Christopher James ; McClung, C. Robertson ; Donald, Bruce Randall

  • Author_Institution
    Dept. of Comput. Sci., Dartmouth Coll., Hanover, NH, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    237
  • Lastpage
    245
  • Abstract
    We introduce a maximum entropy-based analysis technique for extracting and characterizing rhythmic expression profiles from DNA microarray hybridization data. These patterns are clues to discovering genes implicated in cell-cycle, circadian, and other periodic biological processes. The algorithm, implemented in a program called ENRAGE (Entropy-based Rhythmic Analysis of Gene Expression), treats the task of estimating an expression profile´s periodicity and phase as a simultaneous bicriterion optimization problem. Specifically, a frequency domain spectrum is reconstructed from a time-series of gene expression data, subject to two constraints: (a) the likelihood of the spectrum and (b) the Shannon entropy of the reconstructed spectrum. Unlike Fourier-based spectral analysis, maximum entropy spectral reconstruction is well suited to signals of the type generated in DNA microarray experiments. Our algorithm is optimal, running in linear time in the number of expression profiles. Moreover an implementation of our algorithm runs an order of magnitude faster than previous methods. Finally, we demonstrate that ENRAGE is superior to other methods at identifying and characterizing periodic expression profiles on both synthetic and actual DNA microarray hybridization data.
  • Keywords
    DNA; biology computing; genetics; maximum entropy methods; optimisation; pattern matching; time series; DNA microarray hybridization data; ENRAGE; Entropy-based Rhythmic Analysis of Gene Expression; Shannon entropy; bicriterion optimization; biological processes; frequency domain spectrum; genome-wide expression patterns; maximum entropy algorithm; pattern matching; rhythmic analysis; rhythmic expression profiles; time-series; Algorithm design and analysis; Bioinformatics; Biological processes; DNA; Data mining; Entropy; Gene expression; Genomics; Pattern analysis; Phase estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics Conference, 2002. Proceedings. IEEE Computer Society
  • Print_ISBN
    0-7695-1653-X
  • Type

    conf

  • DOI
    10.1109/CSB.2002.1039346
  • Filename
    1039346