DocumentCode
394344
Title
Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM - model and training
Author
Zhou, Jim-Lai ; Seide, Frank ; Deng, Li
Author_Institution
Microsoft Res. Asia, Beijing, China
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
We propose and evaluate a new acoustic model that combines HMM and a special type of the hidden dynamic model (HDM) a target-directed hidden trajectory model - into a single integrated model named HTHMM. The new model provides a computational model of coarticulation by representing the internal dynamics of human speech based on the hidden trajectory of the vocal-tract resonances. This paper focuses on the general structure of the new model and the EM training procedure. The corresponding MAP decoding algorithm and more detailed evaluation are given in Seide et al. (2003). Speech recognition experimental results on the Aurora2 task demonstrated that the new model, although using only context-independent phoneme units (no context-dependent parameters), is still slightly superior in word error rate to the corresponding crossword triphone HMM. This provides the evidence that the coarticulatory mechanism represented by the HTHMM via the model structure matches the traditional context-dependent modeling approach based on enumeration of model parameters.
Keywords
error statistics; hidden Markov models; speech processing; speech recognition; Aurora2 task; EM training procedure; HDM; HMM; HTHMM; acoustic model; coarticulation modeling; context-independent phoneme units; embedding; hidden dynamic model; human speech internal dynamics; integrated model; speech recognition; target-directed hidden trajectory model; training; vocal-tract resonances; word error rate; Asia; Computational modeling; Context modeling; Decoding; Error analysis; Hidden Markov models; Humans; Speech recognition; Stochastic resonance; Trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198888
Filename
1198888
Link To Document