DocumentCode
397773
Title
3D realistic talking face co-driven by text and speech
Author
Song, Mingli ; Chen, Chun ; Bu, Jiajun ; Liang, Ronghua
Author_Institution
Coll. of Comput. Sci., Zhejiang Univ., Hangzhou, China
Volume
3
fYear
2003
fDate
5-8 Oct. 2003
Firstpage
2175
Abstract
To create 3D realistic talking face has been a challenge for a long time. Previous works emphasize text or speech driven talking face respectively while the animation result is not very realistic or natural-looking. In the proposed approach, text and speech are considered to drive the 3D talkingface coordinately. The text is translated into a sequence of visemes´ transcription. And time vector of the sequence is extracted from the speech corresponding to the text after it is segmented into phonetic sequence. A muscle based viseme vector is defined for static viseme. And then, with the time vector and the static visemes´s sequence, dynamic visemes are generated through time-related dominance function. Finally, according to the frame rate to be rendered, intermediate frames are interpolated between key frames to make the animation result looks more natural and realistic than those obtained based on the text or speech-driven only.
Keywords
computer animation; image morphing; image sequences; speech processing; speech synthesis; 3D realistic talking face; animation result; dynamic visemes; frame rate; intermediate frames; muscle based viseme vector; phonetic sequence; speech driven talking face; static visemes sequence; text driven talking face; time vector; time-related dominance function; visemes transcription; Cities and towns; Computer science; Educational institutions; Facial animation; Muscles; Performance analysis; Speech processing; Speech synthesis; Text processing; Virtual reality;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2003. IEEE International Conference on
ISSN
1062-922X
Print_ISBN
0-7803-7952-7
Type
conf
DOI
10.1109/ICSMC.2003.1244206
Filename
1244206
Link To Document