DocumentCode
5908
Title
Multiview Facial Landmark Localization in RGB-D Images via Hierarchical Regression With Binary Patterns
Author
Zhanpeng Zhang ; Wei Zhang ; Jianzhuang Liu ; Xiaoou Tang
Author_Institution
Shenzhen Key Lab. of CVPR, Shenzhen Inst. of Adv. Technol., Shenzhen, China
Volume
24
Issue
9
fYear
2014
fDate
Sept. 2014
Firstpage
1475
Lastpage
1485
Abstract
In this paper, we propose a real-time system of multiview facial landmark localization in RGB-D images. The facial landmark localization problem is formulated into a regression framework, which estimates both the head pose and the landmark positions. In this framework, we propose a coarse-to-fine approach to handle the high-dimensional regression output. At first, 3-D face position and rotation are estimated from the depth observation via a random regression forest. Afterward, the 3-D pose is refined by fusing the estimation from the RGB observation. Finally, the landmarks are located from the RGB observation with gradient boosted decision trees in a pose conditional model. The benefits of the proposed localization framework are twofold: the pose estimation and landmark localization are solved with hierarchical regression, which is different from previous approaches where the pose and landmark locations are iteratively optimized, which relies heavily on the initial pose estimation; due to the different characters of the RGB and depth cues, they are used for landmark localization at different stages and incorporated in a robust manner. In the experiments, we show that the proposed approach outperforms state-of-the-art algorithms on facial landmark localization with RGB-D input.
Keywords
decision trees; face recognition; pose estimation; regression analysis; 3D face position; 3D face rotation; RGB-D images; binary patterns; coarse-to-fine approach; gradient boosted decision tree; head pose estimatiom; hierarchical regression; multiview facial landmark localization; random regression forest; Decision trees; Estimation; Face; Three-dimensional displays; Training; Vegetation; Facial landmark localization; facial landmark localization; gradient boosting decision tree; random binary pattern; random forest; random forest (RF);
fLanguage
English
Journal_Title
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher
ieee
ISSN
1051-8215
Type
jour
DOI
10.1109/TCSVT.2014.2308639
Filename
6748892
Link To Document