• DocumentCode
    22965
  • Title

    Capturing Uncertainty by Modeling Local Transposon Insertion Frequencies Improves Discrimination of Essential Genes

  • Author

    DeJesus, Michael A. ; Ioerger, Thomas R.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Texas A&M Univ., College Station, TX, USA
  • Volume
    12
  • Issue
    1
  • fYear
    2015
  • fDate
    Jan.-Feb. 1 2015
  • Firstpage
    92
  • Lastpage
    102
  • Abstract
    Transposon mutagenesis experiments enable the identification of essential genes in bacteria. Deep-sequencing of mutant libraries provides a large amount of high-resolution data on essentiality. Statistical methods developed to analyze this data have traditionally assumed that the probability of observing a transposon insertion is the same across the genome. This assumption, however, is inconsistent with the observed insertion frequencies from transposon mutant libraries of M. tuberculosis. We propose a modified Binomial model of essentiality that can characterize the insertion probability of individual genes in which we allow local variation in the background insertion frequency in different non-essential regions of the genome. Using the Metropolis-Hastings algorithm, samples of the posterior insertion probabilities were obtained for each gene, and the probability of each gene being essential is estimated. We compared our predictions to those of previous methods and show that, by taking into consideration local insertion frequencies, our method is capable of making more conservative predictions that better match what is experimentally known about essential and non-essential genes.
  • Keywords
    DNA; binomial distribution; cellular biophysics; genetics; genomics; microorganisms; molecular biophysics; molecular configurations; probability; statistical analysis; M. tuberculosis; Metropolis-Hastings algorithm; bacteria; genome; high-resolution data; local transposon insertion frequency; modified Binomial model; mutant library sequence; nonessential genes; posterior insertion probability; statistical methods; transposon mutagenesis; transposon mutant libraries; Bioinformatics; Computational biology; Data models; Genomics; Government; IEEE transactions; Libraries; Sequence analysis; essentiality; hierarchical models;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2014.2326857
  • Filename
    6822561