مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech-to-video synthesis using facial animation parameters

DocumentCode :

2311302

Title :

Speech-to-video synthesis using facial animation parameters

Author :

Aleksic, Petar S. ; Katsaggelos, Aggelos K.

Author_Institution :

Dept. of Electr. & Comput. Eng., Northwestern Univ., Evanston, IL, USA

Volume :

fYear :

2003

fDate :

14-17 Sept. 2003

Abstract :

The presence of visual information in addition to audio could improve speech understanding in noisy environments. This additional information could be especially useful for people with impaired hearing who are able to speechread. This paper focuses on the problem of synthesizing the facial animation parameters (FAPs), supported by the MPEG-4 standard for the visual representation of speech, from a narrowband acoustic speech (telephone) signal. A correlation hidden Markov model (CHMM) system for performing visual speech synthesis is proposed. The CHMM system integrates an independently trained acoustic HMM (AHMM) system and a visual HMM (VHMM) system, in order to realize speech-to-video synthesis. Analyzing the synthesized FAPs and computing the time alignment errors perform objective experiments. Time alignment errors are reduced by 40.5% compared to the conventional temporal scaling method.

Keywords :

audio-visual systems; error analysis; hidden Markov models; speech enhancement; speech intelligibility; speech synthesis; video signal processing; MPEG-4 standard; acoustic HMM system; correlation HMM system; facial animation parameters; hidden Markov model; narrowband acoustic speech signal; speech visual representation; speech-to-video synthesis; time alignment errors; visual HMM system; Acoustic noise; Auditory system; Facial animation; Financial advantage program; Hidden Markov models; MPEG 4 Standard; Narrowband; Signal synthesis; Speech synthesis; Working environment noise;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on

ISSN :

1522-4880

Print_ISBN :

0-7803-7750-8

Type :

conf

DOI :

10.1109/ICIP.2003.1247166

Filename :

1247166

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2311302