DocumentCode
3410540
Title
A hidden Markov model for gene function prediction from sequential expression data
Author
Deng, Xutao ; Ali, Hesham
Author_Institution
Coll. of Inf. Sci. & Technol., Nebraska Univ., Omaha, NE, USA
fYear
2004
fDate
16-19 Aug. 2004
Firstpage
670
Lastpage
671
Abstract
Hidden Markov models (HMMs) have demonstrated great successes in modeling noisy sequential data sets in the area of speech recognition and protein sequence profiling. Results from association test showed significant Markov dependency in time-series gene expression data, and therefore HMMs would be especially appropriate for modeling gene expressions. In this project, we developed a gene function prediction tool based on profile HMMs. Each function class is associated with a distinct HMM whose parameters are trained using yeast time-series gene expression data. The function annotations of the HMM training set were obtained from Munich Information Centre for Protein Sequences (MIPS) data base. We designed several structural variants of HMMs (single, double-split) and tested each of them on forty function classes each of which includes more than one hundred instances. The highest prediction sensitivity we achieved is 51% by using double-split HMM with 3-fold cross-validation.
Keywords
biology computing; genetics; hidden Markov models; molecular biophysics; prediction theory; proteins; time series; Munich Information Centre for Protein Sequences data base; function annotations; gene function prediction; hidden Markov model; sequential expression data; yeast time-series gene expression data; Accuracy; Bayesian methods; Bioinformatics; Educational institutions; Fungi; Gene expression; Genomics; Hidden Markov models; Information science; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN
0-7695-2194-0
Type
conf
DOI
10.1109/CSB.2004.1332541
Filename
1332541
Link To Document