Title :
3-D motion estimation of human head for model-based image coding
Author :
Fukuhara, T. ; Murakami, T.
Author_Institution :
Commun. Syst. Lab., Kanagawa, Japan
Abstract :
Model-based image coding applied to interpersonal communication achieves very low bit-rate image transmission. To accomplish it, accurate three-dimensional (3-D) motion estimation of a speaker is necessary. A new method of 3-D motion estimation is presented, consisting of two steps. In the first, facial contours and feature points of a speaker are extracted using filtering and snake algorithms. Five feature points on a speaker´s facial image are tracked between consecutive picture frames, which gives 2-D motion vectors of the feature points. Then, in the second step, the 3-D motion of a speaker´s head is estimated using a three-layered neural network model, after training with many possible motion patterns of the human head using an existing 3-D general shape model. Experimental results show that the method not only achieves good results but is also more robust than existing methods.<>
Keywords :
feature extraction; filtering and prediction theory; image coding; motion estimation; neural nets; videotelephony; visual communication; 3D motion estimation; facial contours; feature points; filtering; human head; model-based image coding; three-layered neural network model; very low bit-rate image transmission; videophone communication;
Journal_Title :
Communications, Speech and Vision, IEE Proceedings I