مرکز منطقه ای اطلاع رساني علوم و فناوري - Use of Differential Cepstra as Acoustic Features in Hidden Trajectory Modeling for Phonetic Recognition

DocumentCode :

2702127

Title :

Use of Differential Cepstra as Acoustic Features in Hidden Trajectory Modeling for Phonetic Recognition

Author :

Li Deng ; Dong Yu

Author_Institution :

Microsoft Res., Redmond, WA, USA

Volume :

fYear :

2007

fDate :

15-20 April 2007

Abstract :

The earlier version of the hidden trajectory model (HTM) for speech dynamics which predicts the "static" cepstra as the observed acoustic feature is generalized to one which predicts joint static cepstra and their temporal differentials (i.e., delta cepstra). The formulation of this generalized HTM is presented in the generative-modeling framework, enabling efficient computation of the joint likelihood for both static and delta cepstral sequences as the acoustic features given the model. The parameter estimation techniques for the new model are developed and presented, giving closed-form estimation formulas after the use of vector Taylor series approximation. We show principled generalization from the earlier static-cepstra HTM to the new static/delta-cepstra HTM not only in terms of model formulations but also in terms of their respective analytical forms in (monophone) parameter estimation. Experimental results on the standard TIMIT phonetic recognition task demonstrate recognition accuracy improvement over the earlier best HTM system, both significantly better than state-of-the-art triphone HMM systems.

Keywords :

acoustic signal processing; cepstral analysis; parameter estimation; speech recognition; vectors; TIMIT phonetic recognition task; acoustic features; delta cepstral sequences; differential cepstra; hidden trajectory modeling; parameter estimation techniques; speech dynamics; static cepstral sequences; vector Taylor series approximation; Cepstral analysis; Hidden Markov models; Parameter estimation; Pattern recognition; Predictive models; Speech processing; Speech recognition; Taylor series; Trajectory; Video recording; delta cepstra; generative modeling; hidden trajectory modeling; joint static/dynamic feature; phonetic recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

Conference_Location :

Honolulu, HI

ISSN :

1520-6149

Print_ISBN :

1-4244-0727-3

Type :

conf

DOI :

10.1109/ICASSP.2007.366945

Filename :

4218133

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2702127