An online speech driven talking head system

Author

Kai Zhao ; Zhiyong Wu ; Jia Jia ; Lianhong Cai

Author_Institution

Tsinghua-CUHK Joint Res. Center for Media Sci. Technol., Tsinghua Univ., Shenzhen, China

fYear

2012

fDate

18-20 Nov. 2012

Firstpage

186

Lastpage

187

Abstract

This paper presents the design and implementation of an online speech driven talking head animation system. The system first recognizes phoneme sequence from the input speech with a Chinese Mandarin speech recognizer. The phoneme sequence is further transformed to a sequence of visemes. The sequence of MPEG-4 facial animation parameters (FAPs) is further derived from the viseme sequence, and is used to drive the facial animations on a 3-dimentional talking head. The architecture and the major features are also presented in the paper, together with the evaluations of the system.

Keywords

computer animation; natural language processing; speech recognition; speech synthesis; 3-dimentional talking head; Chinese Mandarin speech recognizer; FAP; MPEG-4 facial animation parameters; input speech; online speech driven talking head animation system; phoneme sequence; phoneme sequence recognition; viseme sequence; facial animation parameters (FAPs); talking head; viseme; visual speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

Global High Tech Congress on Electronics (GHTCE), 2012 IEEE

Conference_Location

Shenzhen

Print_ISBN

978-1-4673-5086-0

Type

conf

DOI

10.1109/GHTCE.2012.6490153

Filename

6490153

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=1819668