DocumentCode :
253714
Title :
Towards Unified Human Parsing and Pose Estimation
Author :
Jian Dong ; Qiang Chen ; Xiaohui Shen ; Jianchao Yang ; Shuicheng Yan
Author_Institution :
Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore, Singapore, Singapore
fYear :
2014
fDate :
23-28 June 2014
Firstpage :
843
Lastpage :
850
Abstract :
We study the problem of human body configuration analysis, more specifically, human parsing and human pose estimation. These two tasks, ie identifying the semantic regions and body joints respectively over the human body image, are intrinsically highly correlated. However, previous works generally solve these two problems separately or iteratively. In this work, we propose a unified framework for simultaneous human parsing and pose estimation based on semantic parts. By utilizing Parselets and Mixture of Joint-Group Templates as the representations for these semantic parts, we seamlessly formulate the human parsing and pose estimation problem jointly within a unified framework via a tailored and-or graph. A novel Grid Layout Feature is then designed to effectively capture the spatial co-occurrence/occlusion information between/within the Parselets and MJGTs. Thus the mutually complementary nature of these two tasks can be harnessed to boost the performance of each other. The resultant unified model can be solved using the structure learning framework in a principled way. Comprehensive evaluations on two benchmark datasets for both tasks demonstrate the effectiveness of the proposed framework when compared with the state-of-the-art methods.
Keywords :
graph theory; image representation; learning (artificial intelligence); pose estimation; MJGT; body joint identification; grid layout feature; human body configuration analysis; human body image; human pose estimation problem; mixture of joint-group templates; parselets; semantic part representation; semantic region identification; spatial co-occurrence-occlusion information; structure learning framework; tailored and-or graph; unified human parsing; Deformable models; Estimation; Geometry; Joints; Labeling; Layout; Semantics; Human Parsing; Human Pose Estimation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location :
Columbus, OH
Type :
conf
DOI :
10.1109/CVPR.2014.113
Filename :
6909508
Link To Document :
بازگشت