Title :
Modeling the Marginal Distribution of Gene Expression with Mixture Models
Author :
Wijaya, Edward ; Harada, Hajime ; Horton, Paul
Author_Institution :
Comput. Biol. Res. Center, AIST Waterfront, Tokyo, Japan
Abstract :
We report the results of fitting mixture models to the distribution of expression values for individual genes over a broad range of normal tissues, which we call the marginal distribution of the gene. The base distributions used were normal, lognormal and gamma. The expectation-maximization algorithm was used to learn the model parameters. Experiments with artificial data were performed to ascertain the robustness of learning. Applying the procedure to data from two publicly available microarray datasets, we conclude that lognormal performed the best function for modeling the marginal distributions of gene expression. Our results should provide guidances in the development of informed priors or gene specific normalization for use with gene network inference algorithms.
Keywords :
biological tissues; biology computing; expectation-maximisation algorithm; gamma distribution; genetics; inference mechanisms; log normal distribution; normal distribution; artificial data; base distributions; expectation-maximization algorithm; fitting mixture models; gamma distribution; gene expression data; gene expression marginal distribution modeling; gene network inference algorithms; gene specific normalization; lognormal distribution; normal distribution; normal tissues; Biological system modeling; Biological tissues; Computational biology; Density functional theory; Gene expression; Iterative algorithms; Parameter estimation; Probes; Robustness; Shape; marginal distribution; microarray; mixture models;
Conference_Titel :
Future Generation Communication and Networking, 2008. FGCN '08. Second International Conference on
Conference_Location :
Hainan Island
Print_ISBN :
978-0-7695-3431-2
DOI :
10.1109/FGCN.2008.75