DocumentCode
253910
Title
Iterated Second-Order Label Sensitive Pooling for 3D Human Pose Estimation
Author
Ionescu, Clara ; Carreira, J. ; Sminchisescu, Cristian
Author_Institution
Inst. of Math., Bucharest, Romania
fYear
2014
fDate
23-28 June 2014
Firstpage
1661
Lastpage
1668
Abstract
Recently, the emergence of Kinect systems has demonstrated the benefits of predicting an intermediate body part labeling for 3D human pose estimation, in conjunction with RGB-D imagery. The availability of depth information plays a critical role, so an important question is whether a similar representation can be developed with sufficient robustness in order to estimate 3D pose from RGB images. This paper provides evidence for a positive answer, by leveraging (a) 2D human body part labeling in images, (b) second-order label-sensitive pooling over dynamically computed regions resulting from a hierarchical decomposition of the body, and (c) iterative structured-output modeling to contextualize the process based on 3D pose estimates. For robustness and generalization, we take advantage of a recent large-scale 3D human motion capture dataset, Human3.6M[18] that also has human body part labeling annotations available with images. We provide extensive experimental studies where alternative intermediate representations are compared and report a substantial 33% error reduction over competitive discriminative baselines that regress 3D human pose against global HOG features.
Keywords
image capture; image colour analysis; image representation; iterative methods; pose estimation; 2D human body part labeling annotation; 3D human pose estimation; 3D human pose regression; Human3.6M; Kinect systems; RGB images; RGB-D imagery; depth information availability; global HOG features; hierarchical decomposition; intermediate body part labeling; iterated second-order label sensitive pooling; iterative structured-output modeling; large-scale 3D human motion capture dataset; Computational modeling; Context; Estimation; Feature extraction; Labeling; Three-dimensional displays; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location
Columbus, OH
Type
conf
DOI
10.1109/CVPR.2014.215
Filename
6909611
Link To Document