Title :
Video Realistic Mouth Animation Based on an Audio Visual DBN Model with Articulatory Features and Constrained Asynchrony
Author :
Jiang, Dongmei ; Liu, Peizhen ; Ravyse, Ilse ; Sahli, Hichem ; Verhelst, Werner
Author_Institution :
VUB-NPU Joint Res. Group on AVSP, Northwestern Polytech. Univ., Xi´´an, China
Abstract :
This paper presents a mouth animation construction method based on the DBN models with articulatory features (AF_AVDBN), in which the articulatory features of lips, tongue, glottis/velum can be asynchronous within a maximum asynchrony constraint to describe the speech production process more reasonably. Given an audio input and the trained AF_AVDBN models, the optimal visual feature learning algorithm is deduced based on the Maximum Likelihood Estimation criterion. The learned visual features are then used to construct the mouth images for the input speech. Objective and subjective evaluations on the mouth animations of 110 speech sentences show that the learned visual features from the AF_AVDBN models track the real visual features very closely, and the constructed mouth images from the AF_AVDBN models are very much like the real ones.
Keywords :
computer animation; face recognition; maximum likelihood estimation; speech synthesis; articulatory features; audio visual DBN model; maximum likelihood estimation criterion; mouth images; video realistic mouth animation; visual feature learning algorithm; Covariance matrix; Facial animation; Hidden Markov models; Image converters; MPEG 4 Standard; Maximum likelihood estimation; Mouth; Speech processing; Speech recognition; Speech synthesis; AF_AVDBN; articulatory features; asynchrony; mouth animation;
Conference_Titel :
Image and Graphics, 2009. ICIG '09. Fifth International Conference on
Conference_Location :
Xi´an, Shanxi
Print_ISBN :
978-1-4244-5237-8
DOI :
10.1109/ICIG.2009.51