Title :
The Phoneme-Level Articulator Dynamics for Pronunciation Animation
Author :
Li, Sheng ; Wang, Lan ; Qi, En
Author_Institution :
Shenzhen Inst. of Adv. Technol., Shenzhen, China
Abstract :
Speech visualization can be extended to a task of pronunciation animation for language learners. In this paper, a three dimensional English articulation database is recorded using Carstens Electro-Magnetic Articulograph (EMA AG500). An HMM-based visual synthesis method for continuous speech is implemented to recover 3D articulatory information. The synthesized articulations are then compared to the EMA recordings for objective evaluation. Using a data-driven 3D talking head, the distinctions between the confusable phonemes can be depicted through both external and internal articulatory movements. The experiments have demonstrated that the HMM-based synthesis with limited training data can achieve the minimum RMS error of less than 2mm. The synthesized articulatory movements can be used for computer assisted pronunciation training.
Keywords :
computer animation; computer based training; data visualisation; hidden Markov models; solid modelling; speech synthesis; 3D English articulation database; 3D articulatory information; Carstens Electro-Magnetic Articulograph; HMM-based visual synthesis method; articulatory movement; computer assisted pronunciation training; confusable phoneme; continuous speech synthesis; data-driven 3D talking head; hidden Markov model; language learner; phoneme-level articulator dynamics; pronunciation animation; root mean square error; speech visualization; Coils; Hidden Markov models; Magnetic heads; Speech; Three dimensional displays; Tongue; Trajectory; EMA recordings; external and internal articulators; pronunciation animation;
Conference_Titel :
Asian Language Processing (IALP), 2011 International Conference on
Conference_Location :
Penang
Print_ISBN :
978-1-4577-1733-8
DOI :
10.1109/IALP.2011.13