DocumentCode :
2035761
Title :
Dynamic Audio-Visual Mapping using Fused Hidden Markov Model Inversion Method
Author :
Xin, Le ; Tao, Jianhua ; Tan, Tieniu
Author_Institution :
Inst. of Autom., Beijing
Volume :
3
fYear :
2007
fDate :
Sept. 16 2007-Oct. 19 2007
Abstract :
Realistic audio-visual mapping remains a very challenging problem. Having short time delay between inputs and outputs is also of great importance. In this paper, we present a new dynamic audio-visual mapping approach based on the Fused Hidden Markov Model Inversion method. In our work, the Fused HMM is used to model the loose synchronization nature of the two tightly coupled audio speech and visual speech streams explicitly. Given novel audio inputs, the inversion algorithm is derived to synthesize visual counterparts by maximizing the joint probabilistic distribution of the Fused HMM. When it is implemented in the subsets built from the training corpus, realistic synthesized facial animation having relative short time delay is obtained. Experiments on a 3D motion capture bimodal database show that the synthetic results are comparable with the ground truth.
Keywords :
audio coding; audio databases; computer animation; face recognition; hidden Markov models; inverse problems; probability; speech processing; video coding; video databases; 3D motion capture bimodal database; audio speech stream; dynamic audio-visual mapping; fused hidden Markov model inversion method; joint probabilistic distribution; realistic synthesized facial animation; time delay; visual speech stream; Automation; Delay effects; Facial animation; Hidden Markov models; Laboratories; Pattern recognition; Speech processing; Speech synthesis; Streaming media; Visual databases; 3D motion capture; Baum-Welch inversion; audio-visual mapping; speech driven facial animation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 2007. ICIP 2007. IEEE International Conference on
Conference_Location :
San Antonio, TX
ISSN :
1522-4880
Print_ISBN :
978-1-4244-1437-6
Electronic_ISBN :
1522-4880
Type :
conf
DOI :
10.1109/ICIP.2007.4379304
Filename :
4379304
Link To Document :
بازگشت