Title :
A Deformable Mixture Parsing Model with Parselets
Author :
Jian Dong ; Qiang Chen ; Wei Xia ; Zhongyang Huang ; Shuicheng Yan
Author_Institution :
Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore, Singapore, Singapore
Abstract :
In this work, we address the problem of human parsing, namely partitioning the human body into semantic regions, by using the novel Parselet representation. Previous works often consider solving the problem of human pose estimation as the prerequisite of human parsing. We argue that these approaches cannot obtain optimal pixel level parsing due to the inconsistent targets between these tasks. In this paper, we propose to use Parselets as the building blocks of our parsing model. Parselets are a group of parsable segments which can generally be obtained by low-level over-segmentation algorithms and bear strong semantic meaning. We then build a Deformable Mixture Parsing Model (DMPM) for human parsing to simultaneously handle the deformation and multi-modalities of Parselets. The proposed model has two unique characteristics: (1) the possible numerous modalities of Parse let ensembles are exhibited as the ``And-Or" structure of sub-trees, (2) to further solve the practical problem of Parselet occlusion or absence, we directly model the visibility property at some leaf nodes. The DMPM thus directly solves the problem of human parsing by searching for the best graph configuration from a pool of Parse let hypotheses without intermediate tasks. Comprehensive evaluations demonstrate the encouraging performance of the proposed approach.
Keywords :
graph theory; image representation; image segmentation; pose estimation; trees (mathematics); DMPM; and-or subtree structure; deformable mixture parsing model; graph configuration searching; human body partitioning; human parsing; human pose estimation; leaf nodes; low-level over-segmentation algorithm; optimal pixel level parsing; parselet absence; parselet ensembles; parselet hypothesis; parselet multimodality; parselet occlusion; parselet representation; semantic regions; Deformable models; Estimation; Feature extraction; Hair; Image segmentation; Labeling; Semantics;
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/ICCV.2013.423