Title :
Training a talking head
Author :
Cohen, Michael M. ; Massaro, Dominic W. ; Clark, Rashid
Author_Institution :
Perceptual Sci. Lab., California Univ., Santa Cruz, CA, USA
Abstract :
A Cyberware laser scan of DWM was made, Baldi´s generic morphology was mapped into the form of DWM, this head was trained on real data recorded with Optotrak LED markers, and the quality of its speech was evaluated. Participants were asked to recognize auditory sentences presented alone in noise, aligned with the newly trained synthetic textured mapped target face, or the original natural face. There was a significant advantage when the noisy auditory sentence was paired with either head, with the synthetic textured mapped target face giving as much of an improvement as the original recordings of the natural face.
Keywords :
computer animation; image texture; speech synthesis; speech-based user interfaces; Cyberware laser scan; DWM; Optotrak LED markers; auditory sentence recognition; face animation; generic morphology; noise; speech intelligibility; speech quality; speech synthesis; synthetic textured mapped target face; talking head training; user interface; Facial animation; Humans; Laser modes; Light emitting diodes; Magnetic heads; Morphology; Shape; Speech analysis; Speech processing; Speech synthesis;
Conference_Titel :
Multimodal Interfaces, 2002. Proceedings. Fourth IEEE International Conference on
Print_ISBN :
0-7695-1834-6
DOI :
10.1109/ICMI.2002.1167046