• DocumentCode
    2398081
  • Title

    Context and observation driven latent variable model for human pose estimation

  • Author

    Gupta, Abhinav ; Chen, Trista ; Chen, Francine ; Kimber, Don ; Davis, Larry S.

  • Author_Institution
    Maryland Univ., College Park, MD
  • fYear
    2008
  • fDate
    23-28 June 2008
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Current approaches to pose estimation and tracking can be classified into two categories: generative and discriminative. While generative approaches can accurately determine human pose from image observations, they are computationally expensive due to search in the high dimensional human pose space. On the other hand, discriminative approaches do not generalize well, but are computationally efficient. We present a hybrid model that combines the strengths of the two in an integrated learning and inference framework. We extend the Gaussian process latent variable model (GPLVM) to include an embedding from observation space (the space of image features) to the latent space. GPLVM is a generative model, but the inclusion of this mapping provides a discriminative component, making the model observation driven. Observation Driven GPLVM (OD-GPLVM) not only provides a faster inference approach, but also more accurate estimates (compared to GPLVM) in cases where dynamics are not sufficient for the initialization of search in the latent space. We also extend OD-GPLVM to learn and estimate poses from parameterized actions/gestures. Parameterized gestures are actions which exhibit large systematic variation in joint angle space for different instances due to difference in contextual variables. For example, the joint angles in a forehand tennis shot are function of the height of the ball (Figure 2). We learn these systematic variations as a function of the contextual variables. We then present an approach to use information from scene/objects to provide context for human pose estimation for such parameterized actions.
  • Keywords
    Gaussian processes; gesture recognition; image processing; pose estimation; Gaussian process latent variable model; human pose estimation; image observations; integrated learning; parameterized gestures; pose tracking; Context modeling; Educational institutions; Gaussian processes; Humans; Inverse problems; Layout; Parameter estimation; Parametric statistics; Probability distribution; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
  • Conference_Location
    Anchorage, AK
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-2242-5
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2008.4587511
  • Filename
    4587511