DocumentCode :
80827
Title :
A Local Poisson Graphical Model for Inferring Networks From Sequencing Data
Author :
Allen, Genevera I. ; Zhandong Liu
Author_Institution :
Dept. of Stat. & Electr. & Comput. Eng., Rice Univ., Houston, TX, USA
Volume :
12
Issue :
3
fYear :
2013
fDate :
Sept. 2013
Firstpage :
189
Lastpage :
198
Abstract :
Gaussian graphical models, a class of undirected graphs or Markov Networks, are often used to infer gene networks based on microarray expression data. Many scientists, however, have begun using high-throughput sequencing technologies such as RNA-sequencing or next generation sequencing to measure gene expression. As the resulting data consists of counts of sequencing reads for each gene, Gaussian graphical models are not optimal for this discrete data. In this paper, we propose a novel method for inferring gene networks from sequencing data: the Local Poisson Graphical Model. Our model assumes a Local Markov property where each variable conditional on all other variables is Poisson distributed. We develop a neighborhood selection algorithm to fit our model locally by performing a series of l1 penalized Poisson, or log-linear, regressions. This yields a fast parallel algorithm for estimating networks from next generation sequencing data. In simulations, we illustrate the effectiveness of our methods for recovering network structure from count data. A case study on breast cancer microRNAs (miRNAs), a novel application of graphical models, finds known regulators of breast cancer genes and discovers novel miRNA clusters and hubs that are targets for future research.
Keywords :
Markov processes; Poisson distribution; RNA; bioinformatics; biological organs; cancer; data structures; genetics; genomics; molecular biophysics; molecular configurations; parallel algorithms; physiological models; regression analysis; Gaussian graphical model; Markov network; Poisson distribution; RNA next generation sequencing; breast cancer gene regulator; breast cancer microRNA; data network structure; data recovery; data sequencing; gene expression measurement; gene network inferring; high-throughput sequencing technology; l1 penalized Poisson series; l1 penalized Poisson series; local Markov property; local poisson graphical model; log-linear analysis; microarray data expression; neighborhood selection algorithm; parallel algorithm; regression analysis; undirected graph; Bioinformatics; Genomics; Graphical models; Markov random fields; Sequential analysis; Stability analysis; Gene regulatory networks; Markov networks; microRNAs; next generation sequencing data; undirected graphical models; Breast Neoplasms; Computational Biology; Computer Simulation; Female; Gene Regulatory Networks; Humans; Markov Chains; MicroRNAs; Models, Genetic; Poisson Distribution; Sequence Analysis, DNA;
fLanguage :
English
Journal_Title :
NanoBioscience, IEEE Transactions on
Publisher :
ieee
ISSN :
1536-1241
Type :
jour
DOI :
10.1109/TNB.2013.2263838
Filename :
6578145
Link To Document :
بازگشت