DocumentCode :
2702127
Title :
Use of Differential Cepstra as Acoustic Features in Hidden Trajectory Modeling for Phonetic Recognition
Author :
Li Deng ; Dong Yu
Author_Institution :
Microsoft Res., Redmond, WA, USA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
The earlier version of the hidden trajectory model (HTM) for speech dynamics which predicts the "static" cepstra as the observed acoustic feature is generalized to one which predicts joint static cepstra and their temporal differentials (i.e., delta cepstra). The formulation of this generalized HTM is presented in the generative-modeling framework, enabling efficient computation of the joint likelihood for both static and delta cepstral sequences as the acoustic features given the model. The parameter estimation techniques for the new model are developed and presented, giving closed-form estimation formulas after the use of vector Taylor series approximation. We show principled generalization from the earlier static-cepstra HTM to the new static/delta-cepstra HTM not only in terms of model formulations but also in terms of their respective analytical forms in (monophone) parameter estimation. Experimental results on the standard TIMIT phonetic recognition task demonstrate recognition accuracy improvement over the earlier best HTM system, both significantly better than state-of-the-art triphone HMM systems.
Keywords :
acoustic signal processing; cepstral analysis; parameter estimation; speech recognition; vectors; TIMIT phonetic recognition task; acoustic features; delta cepstral sequences; differential cepstra; hidden trajectory modeling; parameter estimation techniques; speech dynamics; static cepstral sequences; vector Taylor series approximation; Cepstral analysis; Hidden Markov models; Parameter estimation; Pattern recognition; Predictive models; Speech processing; Speech recognition; Taylor series; Trajectory; Video recording; delta cepstra; generative modeling; hidden trajectory modeling; joint static/dynamic feature; phonetic recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366945
Filename :
4218133
Link To Document :
بازگشت