• DocumentCode
    3410540
  • Title

    A hidden Markov model for gene function prediction from sequential expression data

  • Author

    Deng, Xutao ; Ali, Hesham

  • Author_Institution
    Coll. of Inf. Sci. & Technol., Nebraska Univ., Omaha, NE, USA
  • fYear
    2004
  • fDate
    16-19 Aug. 2004
  • Firstpage
    670
  • Lastpage
    671
  • Abstract
    Hidden Markov models (HMMs) have demonstrated great successes in modeling noisy sequential data sets in the area of speech recognition and protein sequence profiling. Results from association test showed significant Markov dependency in time-series gene expression data, and therefore HMMs would be especially appropriate for modeling gene expressions. In this project, we developed a gene function prediction tool based on profile HMMs. Each function class is associated with a distinct HMM whose parameters are trained using yeast time-series gene expression data. The function annotations of the HMM training set were obtained from Munich Information Centre for Protein Sequences (MIPS) data base. We designed several structural variants of HMMs (single, double-split) and tested each of them on forty function classes each of which includes more than one hundred instances. The highest prediction sensitivity we achieved is 51% by using double-split HMM with 3-fold cross-validation.
  • Keywords
    biology computing; genetics; hidden Markov models; molecular biophysics; prediction theory; proteins; time series; Munich Information Centre for Protein Sequences data base; function annotations; gene function prediction; hidden Markov model; sequential expression data; yeast time-series gene expression data; Accuracy; Bayesian methods; Bioinformatics; Educational institutions; Fungi; Gene expression; Genomics; Hidden Markov models; Information science; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
  • Print_ISBN
    0-7695-2194-0
  • Type

    conf

  • DOI
    10.1109/CSB.2004.1332541
  • Filename
    1332541