Title :
Implicit trajectory modelling using temporally varying weight regression for automatic speech recognition
Author :
Liu, Shilin ; Sim, Khe Chai
Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
Abstract :
Recently, implicit trajectory modelling using temporally varying model parameters has achieved promising gains over the discriminatively trained standard HMM system. However, these works only focus on the temporally varying means or precisions explicitly. It is interesting to explore the capability of temporally varying weights, since the effect of time varying Gaussian parameters can be achieved by adjusting the weights of Gaussian Mixture Models (GMM) for different observation. This paper proposes a Temporally Varying Weight Regression (TVWR) model to learn the importance of different Gaussian components under different temporal contexts. Technically, TVWR factorizes the HMM state likelihood such that the contextual information can be modelled using time varying weights. Additionally, approximate constraints are derived to ensure a valid probabilistic model for TVWR. Experimental results for continuous speech recognition on Wall Street Journal show consistent improvements with varying system complexity and about 12% relative significant improvements in the best case.
Keywords :
Gaussian processes; hidden Markov models; probability; regression analysis; speech recognition; GMM; Gaussian mixture models; HMM state likelihood; TVWR model; automatic speech recognition; continuous speech recognition; hidden Markov model system; implicit trajectory modelling; probabilistic model; temporally varying weight regression model; time varying Gaussian parameter effect; trained standard HMM system; Context; Context modeling; Hidden Markov models; Speech recognition; Standards; Training; Trajectory; complexity control; nonlinear constrained optimization; regression; trajectory modelling;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288983