DocumentCode :
3162990
Title :
Stranded Gaussian mixture hidden Markov models for robust speech recognition
Author :
Zhao, Yong ; Juang, Biing-Hwang
Author_Institution :
Dept. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4301
Lastpage :
4304
Abstract :
Gaussian mixture (GMM)-HMMs, though being the predominant modeling technique for speech recognition, are often criticized as being inaccurate to model heterogeneous data sources. In this work, we propose the stranded Gaussian mixture (SGMM)-HMM, an extension of the GMM-HMM, to explicitly model the dependence among the mixture components, i.e., each mixture component is assumed to depend on the previous mixture component in addition to the state that generates it. In the evaluation over the Aurora 2 database, the proposed 20-mixture SGMM system obtains WER of 8.07%, 10% relative improvement over the baseline GMM system. The experiments demonstrate the discriminating power that would be possessed by the mixture weights in their advanced form.
Keywords :
Gaussian processes; hidden Markov models; speech recognition; Aurora 2 database; SGMM-HMM model; WER; mixture components; robust speech recognition; stranded GAUSSIAN mixture hidden MARKOV models; Data models; Hidden Markov models; Probability; Speech; Speech recognition; Synchronization; Trajectory; Dynamic Bayesian network; Gaussian mixture model; hidden Markov model; robust speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288870
Filename :
6288870
Link To Document :
بازگشت