مرکز منطقه ای اطلاع رساني علوم و فناوري - Stranded Gaussian mixture hidden Markov models for robust speech recognition

DocumentCode :

3162990

Title :

Stranded Gaussian mixture hidden Markov models for robust speech recognition

Author :

Zhao, Yong ; Juang, Biing-Hwang

Author_Institution :

Dept. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4301

Lastpage :

4304

Abstract :

Gaussian mixture (GMM)-HMMs, though being the predominant modeling technique for speech recognition, are often criticized as being inaccurate to model heterogeneous data sources. In this work, we propose the stranded Gaussian mixture (SGMM)-HMM, an extension of the GMM-HMM, to explicitly model the dependence among the mixture components, i.e., each mixture component is assumed to depend on the previous mixture component in addition to the state that generates it. In the evaluation over the Aurora 2 database, the proposed 20-mixture SGMM system obtains WER of 8.07%, 10% relative improvement over the baseline GMM system. The experiments demonstrate the discriminating power that would be possessed by the mixture weights in their advanced form.

Keywords :

Gaussian processes; hidden Markov models; speech recognition; Aurora 2 database; SGMM-HMM model; WER; mixture components; robust speech recognition; stranded GAUSSIAN mixture hidden MARKOV models; Data models; Hidden Markov models; Probability; Speech; Speech recognition; Synchronization; Trajectory; Dynamic Bayesian network; Gaussian mixture model; hidden Markov model; robust speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288870

Filename :

6288870

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3162990