مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker-independent speech-driven facial animation using a hierarchical model

DocumentCode :

3525333

Title :

Speaker-independent speech-driven facial animation using a hierarchical model

Author :

Cosker, D.P. ; Marshall, A.D. ; Rosin, P.L. ; Hicks, Y.A.

Author_Institution :

Univ. of Wales, Cardiff, UK

fYear :

2003

fDate :

7-9 July 2003

Firstpage :

169

Lastpage :

172

Abstract :

We present a system capable of producing video-realistic videos of a speaker given audio only. The audio input signal requires no phonetic labelling and is speaker independent. The system requires only a small training set of video to achieve convincing realistic facial synthesis. The system learns the natural mouth and face dynamics of a speaker to allow new facial poses, unseen in the training video, to be synthesised. To achieve this we have developed a novel approach which utilises a hierarchical and nonlinear PCA model which couples speech and appearance. We show that the model is capable of synthesising videos of a speaker using new audio segments from both previously heard and unheard speakers. The model is highly compact making it suitable for a wide range of real-time applications in multimedia and telecommunications using standard hardware.

Keywords :

computer animation; eigenvalues and eigenfunctions; image reconstruction; learning systems; principal component analysis; real-time systems; speech processing; appearance speech association; eigenvectors; face reconstruction; facial animation; hierarchical facial model; machine learning; nonlinear PCA model; principle component analysis; real time systems; speech-appearance model;

fLanguage :

English

Publisher :

iet

Conference_Titel :

Visual Information Engineering, 2003. VIE 2003. International Conference on

ISSN :

0537-9989

Print_ISBN :

0-85296-757-8

Type :

conf

DOI :

10.1049/cp:20030514

Filename :

1341320

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3525333