DocumentCode
1819668
Title
An online speech driven talking head system
Author
Kai Zhao ; Zhiyong Wu ; Jia Jia ; Lianhong Cai
Author_Institution
Tsinghua-CUHK Joint Res. Center for Media Sci. Technol., Tsinghua Univ., Shenzhen, China
fYear
2012
fDate
18-20 Nov. 2012
Firstpage
186
Lastpage
187
Abstract
This paper presents the design and implementation of an online speech driven talking head animation system. The system first recognizes phoneme sequence from the input speech with a Chinese Mandarin speech recognizer. The phoneme sequence is further transformed to a sequence of visemes. The sequence of MPEG-4 facial animation parameters (FAPs) is further derived from the viseme sequence, and is used to drive the facial animations on a 3-dimentional talking head. The architecture and the major features are also presented in the paper, together with the evaluations of the system.
Keywords
computer animation; natural language processing; speech recognition; speech synthesis; 3-dimentional talking head; Chinese Mandarin speech recognizer; FAP; MPEG-4 facial animation parameters; input speech; online speech driven talking head animation system; phoneme sequence; phoneme sequence recognition; viseme sequence; facial animation parameters (FAPs); talking head; viseme; visual speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Global High Tech Congress on Electronics (GHTCE), 2012 IEEE
Conference_Location
Shenzhen
Print_ISBN
978-1-4673-5086-0
Type
conf
DOI
10.1109/GHTCE.2012.6490153
Filename
6490153
Link To Document