مرکز منطقه ای اطلاع رساني علوم و فناوري - Towards Unified Human Parsing and Pose Estimation

DocumentCode :

253714

Title :

Towards Unified Human Parsing and Pose Estimation

Author :

Jian Dong ; Qiang Chen ; Xiaohui Shen ; Jianchao Yang ; Shuicheng Yan

Author_Institution :

Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore, Singapore, Singapore

fYear :

2014

fDate :

23-28 June 2014

Firstpage :

843

Lastpage :

850

Abstract :

We study the problem of human body configuration analysis, more specifically, human parsing and human pose estimation. These two tasks, ie identifying the semantic regions and body joints respectively over the human body image, are intrinsically highly correlated. However, previous works generally solve these two problems separately or iteratively. In this work, we propose a unified framework for simultaneous human parsing and pose estimation based on semantic parts. By utilizing Parselets and Mixture of Joint-Group Templates as the representations for these semantic parts, we seamlessly formulate the human parsing and pose estimation problem jointly within a unified framework via a tailored and-or graph. A novel Grid Layout Feature is then designed to effectively capture the spatial co-occurrence/occlusion information between/within the Parselets and MJGTs. Thus the mutually complementary nature of these two tasks can be harnessed to boost the performance of each other. The resultant unified model can be solved using the structure learning framework in a principled way. Comprehensive evaluations on two benchmark datasets for both tasks demonstrate the effectiveness of the proposed framework when compared with the state-of-the-art methods.

Keywords :

graph theory; image representation; learning (artificial intelligence); pose estimation; MJGT; body joint identification; grid layout feature; human body configuration analysis; human body image; human pose estimation problem; mixture of joint-group templates; parselets; semantic part representation; semantic region identification; spatial co-occurrence-occlusion information; structure learning framework; tailored and-or graph; unified human parsing; Deformable models; Estimation; Geometry; Joints; Labeling; Layout; Semantics; Human Parsing; Human Pose Estimation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on

Conference_Location :

Columbus, OH

Type :

conf

DOI :

10.1109/CVPR.2014.113

Filename :

6909508

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=253714