• DocumentCode
    1120440
  • Title

    BM³E : Discriminative Density Propagation for Visual Tracking

  • Author

    Sminchisescu, Cristian ; Kanaujia, Atul ; Metaxas, Dimitris N.

  • Author_Institution
    Univ. of Chicago, Chicago
  • Volume
    29
  • Issue
    11
  • fYear
    2007
  • Firstpage
    2030
  • Lastpage
    2044
  • Abstract
    We introduce BM3E, a conditional Bayesian mixture of experts Markov model, that achieves consistent probabilistic estimates for discriminative visual tracking. The model applies to problems of temporal and uncertain inference and represents the unexplored bottom-up counterpart of pervasive generative models estimated with Kalman filtering or particle filtering. Instead of inverting a nonlinear generative observation model at runtime, we learn to cooperatively predict complex state distributions directly from descriptors that encode image observations (typically, bag-of-feature global image histograms or descriptors computed over regular spatial grids). These are integrated in a conditional graphical model in order to enforce temporal smoothness constraints and allow a principled management of uncertainty. The algorithms combine sparsity, mixture modeling, and nonlinear dimensionality reduction for efficient computation in high-dimensional continuous state spaces. The combined system automatically self-initializes and recovers from failure. The research has three contributions: (1) we establish the density propagation rules for discriminative inference in continuous, temporal chain models, (2) we propose flexible supervised and unsupervised algorithms to learn feed-forward, multivalued contextual mappings (multimodal state distributions) based on compact, conditional Bayesian mixture of experts models, and (3) we validate the framework empirically for the reconstruction of 3D human motion in monocular video sequences. Our tests on both real and motion-capture-based sequences show significant performance gains with respect to competing nearest neighbor, regression, and structured prediction methods.
  • Keywords
    Bayes methods; Markov processes; image motion analysis; image reconstruction; inference mechanisms; 3D human motion reconstruction; BM3E; Kalman filtering; Markov model; complex state distributions; conditional Bayesian mixture; continuous temporal chain models; discriminative density propagation; discriminative inference; discriminative visual tracking; feedforward multivalued contextual mappings; image encoding; monocular video sequences; multimodal state distributions; particle filtering; Bayesian methods; Context modeling; Distributed computing; Filtering; Histograms; Inference algorithms; Kalman filters; Mesh generation; Predictive models; Runtime; computer vision; motion; statistical models; tracking; video analysis; Algorithms; Artificial Intelligence; Bayes Theorem; Computer Simulation; Discriminant Analysis; Image Enhancement; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Information Storage and Retrieval; Markov Chains; Models, Statistical; Motion; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Video Recording;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2007.1111
  • Filename
    4302766