• DocumentCode
    951880
  • Title

    Statistical Characterization of Protein Ensembles

  • Author

    Rother, Diego ; Sapiro, Guillermo ; Pande, Vijay

  • Author_Institution
    Univ. of Minnesota, Minneapolis
  • Volume
    5
  • Issue
    1
  • fYear
    2008
  • Firstpage
    42
  • Lastpage
    55
  • Abstract
    When accounting for structural fluctuations or measurement errors, a single rigid structure may not be sufficient to represent a protein. One approach to solve this problem is to represent the possible conformations as a discrete set of observed conformations, an ensemble. In this work, we follow a different richer approach and introduce a framework for estimating probability density functions in very high dimensions and then apply it to represent ensembles of folded proteins. This proposed approach combines techniques such as kernel density estimation, maximum likelihood, cross validation, and bootstrapping. We present the underlying theoretical and computational framework and apply it to artificial data and protein ensembles obtained from molecular dynamics simulations. We compare the results with those obtained experimentally, illustrating the potential and advantages of this representation.
  • Keywords
    density functional theory; molecular biophysics; molecular dynamics method; proteins; bootstrapping; cross validation; folded protein; kernel density estimation; maximum likelihood; measurement errors; molecular dynamics simulations; probability density functions; single rigid structure; structural fluctuations; Bayesian networks; bootstrapping; cross-validation; density estimation; graphical models; maximum likelihood; protein ensembles; Algorithms; Amino Acid Motifs; Computer Simulation; Likelihood Functions; Microfilament Proteins; Models, Molecular; Peptides; Probability; Protein Conformation; Proteins;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2007.1061
  • Filename
    4359856