DocumentCode
1266511
Title
Affine-structure-based facial image encoding
Author
Chatterjee, Shoma ; Banerjee, Subhashis ; Biswas, K.K.
Author_Institution
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., New Delhi, India
Volume
146
Issue
4
fYear
1999
fDate
8/1/1999 12:00:00 AM
Firstpage
211
Lastpage
221
Abstract
A real-time algorithm for affine-structure-based video compression for facial images is presented. The face undergoing motion is segmented and triangulated to yield a set of control points. The set of control points generated by triangulation are tracked across a few frames using an intensity-based correlation technique. For accurate motion and structure estimation a Kalman-filter-based algorithm is used to track features on the facial image. The structure information of the control points is transmitted only during the bootstrapping stage. After that only the motion information is transmitted to the decoder. This reduces the number of motion parameters associated with control points in each frame. The local motion of the eyes and lips is captured using local 2-D affine transformations. For real time implementation a quad-tree based search technique is adopted to solve local correlation. Any remaining reconstruction error is accounted for using predictive encoding. Results on real image sequences demonstrate the applicability of the method
Keywords
Kalman filters; correlation methods; data compression; filtering theory; image motion analysis; image reconstruction; image segmentation; image sequences; quadtrees; tracking; video coding; Kalman-filter-based algorithm; affine-structure-based facial image encoding; bootstrapping; face segmentation; face triangulation; facial images; feature tracking; intensity-based correlation technique; local 2D affine transformations; local correlation; low-bit rate video; motion information; motion parameters; predictive encoding; quad-tree based search technique; real image sequences; real-time algorithm; reconstruction error; video compression;
fLanguage
English
Journal_Title
Vision, Image and Signal Processing, IEE Proceedings -
Publisher
iet
ISSN
1350-245X
Type
jour
DOI
10.1049/ip-vis:19990513
Filename
803323
Link To Document