Title :
Estimating Genome-Wide Gene Networks Using Nonparametric Bayesian Network Models on Massively Parallel Computers
Author :
Tamada, Yoshinori ; Imoto, Seiya ; Araki, Hiromitsu ; Nagasaki, Masao ; Print, Cristin ; Charnock-Jones, D. Stephen ; Miyano, Satoru
Author_Institution :
Human Genome Center, Univ. of Tokyo, Tokyo, Japan
Abstract :
We present a novel algorithm to estimate genome-wide gene networks consisting of more than 20,000 genes from gene expression data using nonparametric Bayesian networks. Due to the difficulty of learning Bayesian network structures, existing algorithms cannot be applied to more than a few thousand genes. Our algorithm overcomes this limitation by repeatedly estimating subnetworks in parallel for genes selected by neighbor node sampling. Through numerical simulation, we confirmed that our algorithm outperformed a heuristic algorithm in a shorter time. We applied our algorithm to microarray data from human umbilical vein endothelial cells (HUVECs) treated with siRNAs, to construct a human genome-wide gene network, which we compared to a small gene network estimated for the genes extracted using a traditional bioinformatics method. The results showed that our genome-wide gene network contains many features of the small network, as well as others that could not be captured during the small network estimation. The results also revealed master-regulator genes that are not in the small network but that control many of the genes in the small network. These analyses were impossible to realize without our proposed algorithm.
Keywords :
belief networks; biochemistry; bioinformatics; cellular biophysics; data analysis; genetics; genomics; molecular biophysics; bioinformatics; genome-wide gene network; human umbilical vein endothelial cells; microarray data; nonparametric Bayesian network; parallel computers; siRNA; Algorithm design and analysis; Artificial neural networks; Bayesian methods; Bioinformatics; Computers; Estimation; Genomics; Bayesian network structure learning; Biology and genetics; gene expression data analysis.; gene networks; Algorithms; Bayes Theorem; Computational Biology; Computer Simulation; Databases, Genetic; Endothelial Cells; Gene Expression Profiling; Gene Regulatory Networks; Genome; Humans; Models, Genetic; Oligonucleotide Array Sequence Analysis; RNA, Small Interfering; Statistics, Nonparametric;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2010.68