مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2971597

Title :

Training a talking head

Author :

Cohen, Michael M. ; Massaro, Dominic W. ; Clark, Rashid

Author_Institution :

Perceptual Sci. Lab., California Univ., Santa Cruz, CA, USA

fYear :

2002

fDate :

2002

Firstpage :

499

Lastpage :

504

Abstract :

A Cyberware laser scan of DWM was made, Baldi´s generic morphology was mapped into the form of DWM, this head was trained on real data recorded with Optotrak LED markers, and the quality of its speech was evaluated. Participants were asked to recognize auditory sentences presented alone in noise, aligned with the newly trained synthetic textured mapped target face, or the original natural face. There was a significant advantage when the noisy auditory sentence was paired with either head, with the synthetic textured mapped target face giving as much of an improvement as the original recordings of the natural face.

Keywords :

computer animation; image texture; speech synthesis; speech-based user interfaces; Cyberware laser scan; DWM; Optotrak LED markers; auditory sentence recognition; face animation; generic morphology; noise; speech intelligibility; speech quality; speech synthesis; synthetic textured mapped target face; talking head training; user interface; Facial animation; Humans; Laser modes; Light emitting diodes; Magnetic heads; Morphology; Shape; Speech analysis; Speech processing; Speech synthesis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimodal Interfaces, 2002. Proceedings. Fourth IEEE International Conference on

Print_ISBN :

0-7695-1834-6

Type :

conf

DOI :

10.1109/ICMI.2002.1167046

Filename :

1167046

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2971597