• DocumentCode
    2919949
  • Title

    Supervised local subspace learning for continuous head pose estimation

  • Author

    Huang, Dong ; Storer, Markus ; De La Torre, Fernando ; Bischof, Horst

  • fYear
    2011
  • fDate
    20-25 June 2011
  • Firstpage
    2921
  • Lastpage
    2928
  • Abstract
    Head pose estimation from images has recently attracted much attention in computer vision due to its diverse applications in face recognition, driver monitoring and human computer interaction. Most successful approaches to head pose estimation formulate the problem as a nonlinear regression between image features and continuous 3D angles (i.e. yaw, pitch and roll). However, regression-like methods suffer from three main drawbacks: (1) They typically lack generalization and overfit when trained using a few samples. (2) They fail to get reliable estimates over some regions of the output space (angles) when the training set is not uniformly sampled. For instance, if the training data contains under-sampled areas for some angles. (3) They are not robust to image noise or occlusion. To address these problems, this paper presents Supervised Local Subspace Learning (SL2), a method that learns a local linear model from a sparse and non-uniformly sampled training set. SL2 learns a mixture of local tangent spaces that is robust to under-sampled regions, and due to its regularization properties it is also robust to over-fitting. Moreover, because SL2 is a generative model, it can deal with image noise. Experimental results on the CMU Multi-PIE and BU-3DFE database show the effectiveness of our approach in terms of accuracy and computational complexity.
  • Keywords
    computational complexity; computer vision; face recognition; learning (artificial intelligence); pose estimation; regression analysis; BU-3DFE database; CMU multi-PIE database; computational complexity; computer vision; continuous 3D angles; continuous head pose estimation; driver monitoring; face recognition; human computer interaction; image features; image noise; image occlusion; local linear model; local tangent spaces; nonlinear regression; nonuniformly sampled training set; pitch; roll; sparse sampled training set; supervised local subspace learning; yaw;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
  • Conference_Location
    Providence, RI
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4577-0394-2
  • Type

    conf

  • DOI
    10.1109/CVPR.2011.5995683
  • Filename
    5995683