DocumentCode :
3484579
Title :
Discriminative splitting of Gaussian/log-linear mixture HMMs for speech recognition
Author :
Tahir, Muhammad Ali ; Schlüter, Ralf ; Ney, Hermann
Author_Institution :
Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
fYear :
2011
fDate :
11-15 Dec. 2011
Firstpage :
7
Lastpage :
11
Abstract :
This paper presents a method to incorporate mixture density splitting into the acoustic model discriminative log-linear training. The standard method is to obtain a high resolution model by maximum likelihood training and density splitting, and then further training this model discriminatively. For a single Gaussian density per state the log-linear MMI optimization is a global maximum problem, and by further splitting and discriminative training of this model we can get a higher complexity model. The mixture training is not a global maximum problem, nevertheless experimentally we achieve large gains in the objective function and corresponding moderate gains in the word error rate on a large vocabulary corpus.
Keywords :
Gaussian processes; hidden Markov models; maximum likelihood estimation; speech recognition; Gaussian-log-linear mixture HMM; acoustic model discriminative log-linear training; complexity model; density splitting; discriminative splitting; high resolution model; log-linear MMI optimization; maximum likelihood training; mixture density splitting; speech recognition; vocabulary corpus; Hidden Markov models; Mel frequency cepstral coefficient; Optimization; Speech recognition; Training; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
Type :
conf
DOI :
10.1109/ASRU.2011.6163896
Filename :
6163896
Link To Document :
بازگشت