Title :
An online speech driven talking head system
Author :
Kai Zhao ; Zhiyong Wu ; Jia Jia ; Lianhong Cai
Author_Institution :
Tsinghua-CUHK Joint Res. Center for Media Sci. Technol., Tsinghua Univ., Shenzhen, China
Abstract :
This paper presents the design and implementation of an online speech driven talking head animation system. The system first recognizes phoneme sequence from the input speech with a Chinese Mandarin speech recognizer. The phoneme sequence is further transformed to a sequence of visemes. The sequence of MPEG-4 facial animation parameters (FAPs) is further derived from the viseme sequence, and is used to drive the facial animations on a 3-dimentional talking head. The architecture and the major features are also presented in the paper, together with the evaluations of the system.
Keywords :
computer animation; natural language processing; speech recognition; speech synthesis; 3-dimentional talking head; Chinese Mandarin speech recognizer; FAP; MPEG-4 facial animation parameters; input speech; online speech driven talking head animation system; phoneme sequence; phoneme sequence recognition; viseme sequence; facial animation parameters (FAPs); talking head; viseme; visual speech synthesis;
Conference_Titel :
Global High Tech Congress on Electronics (GHTCE), 2012 IEEE
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4673-5086-0
DOI :
10.1109/GHTCE.2012.6490153