Automatic audio driven animation of non-verbal actions

Author

Cosker, D. ; Holt, C. ; Whatling, G. ; Rosin, P.L.

Author_Institution

Media Technol. Res. Centre, Univ. of Bath, Bath

fYear

2007

fDate

27-28 Nov. 2007

Firstpage

1

Lastpage

1

Abstract

While speech driven animation for lip-synching and facial expression synthesis from speech has previously received much attention, there is no previous work on generating non-verbal actions such as laughing and crying automatically from an audio signal. In this article initial results on a system designed to address this issue are presented. 3D facial data is recorded for a participant making different actions-i.e. laughing, crying, yawning and sneezing-using a Qualysis (Sweden) optical motion-capture system while simultaneously recording audio data. 30 retro-reflective markers were placed on the participant´s face to capture movement. Using this data, an analysis and synthesis machine was then trained consisting of a dual-input Hidden Markov Model (HMM) and a trellis search algorithm which converts HMM visual states and new input audio into new 3D motion-capture data.

Keywords

computer animation; emotion recognition; face recognition; hidden Markov models; image motion analysis; search problems; solid modelling; speech synthesis; 3D facial model; Qualysis optical motion-capture system; automatic audio driven animation; facial expression synthesis; hidden Markov model; lip-synching; nonverbal action; retro-reflective marker; speech driven animation; trellis search algorithm; Animation; HMM; Motion-Capture; Non-Verbal;

fLanguage

English

Publisher

iet

Conference_Titel

Visual Media Production, 2007. IETCVMP. 4th European Conference on

Conference_Location

London

Print_ISBN

978-0-86341-843-3

Type

conf

Filename

4454260