DocumentCode :
3165393
Title :
Implicit trajectory modelling using temporally varying weight regression for automatic speech recognition
Author :
Liu, Shilin ; Sim, Khe Chai
Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4761
Lastpage :
4764
Abstract :
Recently, implicit trajectory modelling using temporally varying model parameters has achieved promising gains over the discriminatively trained standard HMM system. However, these works only focus on the temporally varying means or precisions explicitly. It is interesting to explore the capability of temporally varying weights, since the effect of time varying Gaussian parameters can be achieved by adjusting the weights of Gaussian Mixture Models (GMM) for different observation. This paper proposes a Temporally Varying Weight Regression (TVWR) model to learn the importance of different Gaussian components under different temporal contexts. Technically, TVWR factorizes the HMM state likelihood such that the contextual information can be modelled using time varying weights. Additionally, approximate constraints are derived to ensure a valid probabilistic model for TVWR. Experimental results for continuous speech recognition on Wall Street Journal show consistent improvements with varying system complexity and about 12% relative significant improvements in the best case.
Keywords :
Gaussian processes; hidden Markov models; probability; regression analysis; speech recognition; GMM; Gaussian mixture models; HMM state likelihood; TVWR model; automatic speech recognition; continuous speech recognition; hidden Markov model system; implicit trajectory modelling; probabilistic model; temporally varying weight regression model; time varying Gaussian parameter effect; trained standard HMM system; Context; Context modeling; Hidden Markov models; Speech recognition; Standards; Training; Trajectory; complexity control; nonlinear constrained optimization; regression; trajectory modelling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288983
Filename :
6288983
Link To Document :
بازگشت