Title :
3D Visible Speech Animation Driven by Chinese Prosody Markup Language
Author :
Zhang, Siguang ; Wang, Lichun ; Tang, Hengliang
Author_Institution :
Coll. of Comput. Sci., Beijing Univ. of Technol., Beijing
Abstract :
This paper proposes a new approach for generating smart 3D speech animation. The basic idea is to synthesize the animated faces using prosodic information edited by user with a kind of markup language. The proposed technique takes advantage of both performance-driven and parameter-driven approaches. So it greatly reduces the workload of manual modeling used in the traditional key frame animation and the animation generating process can be easily control. To relate the prosody text with the 3D animation, our technique builds up a parametric model based on the exponential formula. It takes the pre-obtained 3D dynamic visemes and prosodic tag recorded in CPML (Chinese Prosody Markup Language) as input data, and outputs a segment of vivid speech animation. Experimental results show that (1) the proposed technique synthesizes animation of different effects depending on the availability with the prosodic information, and (2) the new technique produces realistic results using less data than the conventional methods.
Keywords :
computer animation; face recognition; natural language processing; speech synthesis; text analysis; 3D dynamic viseme synthesis; Chinese prosody markup language; exponential formula; face animation synthesis; parameter-driven approach; performance-driven approach; prosody text; smart 3D visible speech animation; Databases; Facial animation; Magnetic heads; Markup languages; Motion estimation; Mouth; Muscles; Parameter estimation; Shape; Speech synthesis; 3D visible speech animation; dynamic viseme; prosodic model; prosody text markup language;
Conference_Titel :
Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on
Conference_Location :
Dalian Liaoning
Print_ISBN :
978-0-7695-3273-8
DOI :
10.1109/ALPIT.2008.56