DocumentCode
22965
Title
Capturing Uncertainty by Modeling Local Transposon Insertion Frequencies Improves Discrimination of Essential Genes
Author
DeJesus, Michael A. ; Ioerger, Thomas R.
Author_Institution
Dept. of Comput. Sci. & Eng., Texas A&M Univ., College Station, TX, USA
Volume
12
Issue
1
fYear
2015
fDate
Jan.-Feb. 1 2015
Firstpage
92
Lastpage
102
Abstract
Transposon mutagenesis experiments enable the identification of essential genes in bacteria. Deep-sequencing of mutant libraries provides a large amount of high-resolution data on essentiality. Statistical methods developed to analyze this data have traditionally assumed that the probability of observing a transposon insertion is the same across the genome. This assumption, however, is inconsistent with the observed insertion frequencies from transposon mutant libraries of M. tuberculosis. We propose a modified Binomial model of essentiality that can characterize the insertion probability of individual genes in which we allow local variation in the background insertion frequency in different non-essential regions of the genome. Using the Metropolis-Hastings algorithm, samples of the posterior insertion probabilities were obtained for each gene, and the probability of each gene being essential is estimated. We compared our predictions to those of previous methods and show that, by taking into consideration local insertion frequencies, our method is capable of making more conservative predictions that better match what is experimentally known about essential and non-essential genes.
Keywords
DNA; binomial distribution; cellular biophysics; genetics; genomics; microorganisms; molecular biophysics; molecular configurations; probability; statistical analysis; M. tuberculosis; Metropolis-Hastings algorithm; bacteria; genome; high-resolution data; local transposon insertion frequency; modified Binomial model; mutant library sequence; nonessential genes; posterior insertion probability; statistical methods; transposon mutagenesis; transposon mutant libraries; Bioinformatics; Computational biology; Data models; Genomics; Government; IEEE transactions; Libraries; Sequence analysis; essentiality; hierarchical models;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2014.2326857
Filename
6822561
Link To Document