DocumentCode :
1466702
Title :
3-D Head Tracking via Invariant Keypoint Learning
Author :
Wang, Haibo ; Davoine, Franck ; Lepetit, Vincent ; Chaillou, Christophe ; Pan, Chunhong
Author_Institution :
Shandong Univ., Jinan, China
Volume :
22
Issue :
8
fYear :
2012
Firstpage :
1113
Lastpage :
1126
Abstract :
Keypoint matching is a standard tool to solve the correspondence problem in vision applications. However, in 3-D face tracking, this approach is often deficient because the human face complexities, together with its rich viewpoint, nonrigid expression, and lighting variations in typical applications, can cause many variations impossible to handle by existing keypoint detectors and descriptors. In this paper, we propose a new approach to tailor keypoint matching to track the 3-D pose of the user head in a video stream. The core idea is to learn keypoints that are explicitly invariant to these challenging transformations. First, we select keypoints that are stable under randomly drawn small viewpoints, nonrigid deformations, and illumination changes. Then, we treat keypoint descriptor learning at different large angles as an incremental scheme to learn discriminative descriptors. At matching time, to reduce the ratio of outlier correspondences, we use second-order color information to prune keypoints unlikely to lie on the face. Moreover, we integrate optical flow correspondences in an adaptive way to remove motion jitter efficiently. Extensive experiments show that the proposed approach can lead to fast, robust, and accurate 3-D head tracking results even under very challenging scenarios.
Keywords :
computer vision; image colour analysis; image matching; image motion analysis; image sequences; jitter; learning (artificial intelligence); lighting; object tracking; pose estimation; video signal processing; 3D face tracking; 3D head tracking; 3D pose tracking; discriminative descriptor learning; illumination changes; incremental learning; invariant keypoint learning; keypoint descriptor learning; keypoint detector; keypoint matching; lighting variations; matching time; motion jitter removal; nonrigid deformations; nonrigid expression; optical flow correspondences; outlier correspondences ratio reduction; second-order color information; video stream; vision applications; Face; Image color analysis; Lighting; Nonlinear distortion; Optical imaging; Three dimensional displays; 3-D head tracking; keypoint-based tracking; pose estimation;
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2012.2190474
Filename :
6166872
Link To Document :
بازگشت