مرکز منطقه ای اطلاع رساني علوم و فناوري - Implicit trajectory modelling using temporally varying weight regression for automatic speech recognition

DocumentCode :

3165393

Title :

Implicit trajectory modelling using temporally varying weight regression for automatic speech recognition

Author :

Liu, Shilin ; Sim, Khe Chai

Author_Institution :

Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4761

Lastpage :

4764

Abstract :

Recently, implicit trajectory modelling using temporally varying model parameters has achieved promising gains over the discriminatively trained standard HMM system. However, these works only focus on the temporally varying means or precisions explicitly. It is interesting to explore the capability of temporally varying weights, since the effect of time varying Gaussian parameters can be achieved by adjusting the weights of Gaussian Mixture Models (GMM) for different observation. This paper proposes a Temporally Varying Weight Regression (TVWR) model to learn the importance of different Gaussian components under different temporal contexts. Technically, TVWR factorizes the HMM state likelihood such that the contextual information can be modelled using time varying weights. Additionally, approximate constraints are derived to ensure a valid probabilistic model for TVWR. Experimental results for continuous speech recognition on Wall Street Journal show consistent improvements with varying system complexity and about 12% relative significant improvements in the best case.

Keywords :

Gaussian processes; hidden Markov models; probability; regression analysis; speech recognition; GMM; Gaussian mixture models; HMM state likelihood; TVWR model; automatic speech recognition; continuous speech recognition; hidden Markov model system; implicit trajectory modelling; probabilistic model; temporally varying weight regression model; time varying Gaussian parameter effect; trained standard HMM system; Context; Context modeling; Hidden Markov models; Speech recognition; Standards; Training; Trajectory; complexity control; nonlinear constrained optimization; regression; trajectory modelling;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288983

Filename :

6288983

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3165393