• DocumentCode
    5908
  • Title

    Multiview Facial Landmark Localization in RGB-D Images via Hierarchical Regression With Binary Patterns

  • Author

    Zhanpeng Zhang ; Wei Zhang ; Jianzhuang Liu ; Xiaoou Tang

  • Author_Institution
    Shenzhen Key Lab. of CVPR, Shenzhen Inst. of Adv. Technol., Shenzhen, China
  • Volume
    24
  • Issue
    9
  • fYear
    2014
  • fDate
    Sept. 2014
  • Firstpage
    1475
  • Lastpage
    1485
  • Abstract
    In this paper, we propose a real-time system of multiview facial landmark localization in RGB-D images. The facial landmark localization problem is formulated into a regression framework, which estimates both the head pose and the landmark positions. In this framework, we propose a coarse-to-fine approach to handle the high-dimensional regression output. At first, 3-D face position and rotation are estimated from the depth observation via a random regression forest. Afterward, the 3-D pose is refined by fusing the estimation from the RGB observation. Finally, the landmarks are located from the RGB observation with gradient boosted decision trees in a pose conditional model. The benefits of the proposed localization framework are twofold: the pose estimation and landmark localization are solved with hierarchical regression, which is different from previous approaches where the pose and landmark locations are iteratively optimized, which relies heavily on the initial pose estimation; due to the different characters of the RGB and depth cues, they are used for landmark localization at different stages and incorporated in a robust manner. In the experiments, we show that the proposed approach outperforms state-of-the-art algorithms on facial landmark localization with RGB-D input.
  • Keywords
    decision trees; face recognition; pose estimation; regression analysis; 3D face position; 3D face rotation; RGB-D images; binary patterns; coarse-to-fine approach; gradient boosted decision tree; head pose estimatiom; hierarchical regression; multiview facial landmark localization; random regression forest; Decision trees; Estimation; Face; Three-dimensional displays; Training; Vegetation; Facial landmark localization; facial landmark localization; gradient boosting decision tree; random binary pattern; random forest; random forest (RF);
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2014.2308639
  • Filename
    6748892