مرکز منطقه ای اطلاع رساني علوم و فناوري - Near-videorealistic synthetic visual speech using non-rigid appearance models

DocumentCode :

394751

Title :

Near-videorealistic synthetic visual speech using non-rigid appearance models

Author :

Theobald, Barry J. ; Cawley, Gavin C. ; Matthews, Iain A. ; Bangham, J. Andrew

Author_Institution :

Sch. of Inf. Syst., East Anglia Univ., Norwich, UK

Volume :

fYear :

2003

fDate :

6-10 April 2003

Abstract :

We present work towards videorealistic synthetic visual speech using non-rigid appearance models. These models are used to track a talking face enunciating a set of training sentences. The resultant parameter trajectories are used in a concatenative synthesis scheme, where samples of original data are extracted from a corpus and concatenated to form new unseen sequences. Here we explore the effect on the synthesiser output of blending several synthesis units considered similar to the desired unit. We present preliminary subjective and objective results used to judge the realism of the system.

Keywords :

computer animation; image sequences; solid modelling; speech processing; video signal processing; computer graphics; computer vision; concatenative synthesis scheme; near-videorealistic synthetic visual speech; nonrigid appearance models; parameter trajectories; speech processing; talking face; training sentences; video sequences; visual speech synthesiser; Application software; Bandwidth; Covariance matrix; Face detection; Facial animation; Hidden Markov models; Principal component analysis; Shape; Speech synthesis; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-7663-3

Type :

conf

DOI :

10.1109/ICASSP.2003.1200092

Filename :

1200092

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=394751