Title :
An intelligent facial image coding driven by speech and phoneme
Author :
Morishima, Shigeo ; Aizawa, Kiyoharu ; Harashima, Himshi
Author_Institution :
Dept. of Electr. Eng., Seikei Univ., Tokyo, Japan
Abstract :
The authors propose and compare two types of model-based facial motion coding schemes, i.e. synthesis by rules and synthesis by parameters. In synthesis by rules, facial motion images are synthesized on the basis of rules extracted by analysis of training image samples that include all of the phonemes and coarticulation. This system can be utilized as an automatic facial animation synthesizer from text input or as a man-machine interface using the facial motion image. In synthesis by parameters, facial motion images are synthesized on the basis of a code word index of speech parameters. Experimental results indicate good performance for both systems, which can create natural facial-motion images with very low transmission rate. Details of 3-D modeling, algorithm synthesis, and performance are discussed
Keywords :
computerised picture processing; encoding; speech analysis and processing; 3-D modeling; automatic facial animation synthesizer; coarticulation; code word index; facial motion coding; intelligent facial image coding; man-machine interface; phoneme; speech parameters; synthesis by parameters; synthesis by rules; training image samples; Facial animation; Image analysis; Image coding; Image motion analysis; Motion analysis; Speech coding; Speech synthesis; Synthesizers; Three dimensional displays; User interfaces;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location :
Glasgow
DOI :
10.1109/ICASSP.1989.266799