• DocumentCode
    253910
  • Title

    Iterated Second-Order Label Sensitive Pooling for 3D Human Pose Estimation

  • Author

    Ionescu, Clara ; Carreira, J. ; Sminchisescu, Cristian

  • Author_Institution
    Inst. of Math., Bucharest, Romania
  • fYear
    2014
  • fDate
    23-28 June 2014
  • Firstpage
    1661
  • Lastpage
    1668
  • Abstract
    Recently, the emergence of Kinect systems has demonstrated the benefits of predicting an intermediate body part labeling for 3D human pose estimation, in conjunction with RGB-D imagery. The availability of depth information plays a critical role, so an important question is whether a similar representation can be developed with sufficient robustness in order to estimate 3D pose from RGB images. This paper provides evidence for a positive answer, by leveraging (a) 2D human body part labeling in images, (b) second-order label-sensitive pooling over dynamically computed regions resulting from a hierarchical decomposition of the body, and (c) iterative structured-output modeling to contextualize the process based on 3D pose estimates. For robustness and generalization, we take advantage of a recent large-scale 3D human motion capture dataset, Human3.6M[18] that also has human body part labeling annotations available with images. We provide extensive experimental studies where alternative intermediate representations are compared and report a substantial 33% error reduction over competitive discriminative baselines that regress 3D human pose against global HOG features.
  • Keywords
    image capture; image colour analysis; image representation; iterative methods; pose estimation; 2D human body part labeling annotation; 3D human pose estimation; 3D human pose regression; Human3.6M; Kinect systems; RGB images; RGB-D imagery; depth information availability; global HOG features; hierarchical decomposition; intermediate body part labeling; iterated second-order label sensitive pooling; iterative structured-output modeling; large-scale 3D human motion capture dataset; Computational modeling; Context; Estimation; Feature extraction; Labeling; Three-dimensional displays; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
  • Conference_Location
    Columbus, OH
  • Type

    conf

  • DOI
    10.1109/CVPR.2014.215
  • Filename
    6909611