DocumentCode :
454538
Title :
Augmented Statistical Models for Speech Recognition
Author :
Layton, M.I. ; Gales, M.J.F.
Author_Institution :
Eng. Dept., Cambridge Univ.
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
Recently there has been significant interest in developing new acoustic models for speech recognition. One such model, that allows complex dependencies to be represented, is the augmented statistical model. This incorporates additional dependencies by constructing a local exponential expansion of a standard HMM. Unfortunately, the resulting model often has an intractable normalisation term, rendering training difficult for all but binary classification tasks. In this paper, conditional augmented (C-Aug) models are proposed as an attractive alternative. Instead of modelling utterance likelihoods and inferring decision boundaries, C-Aug models directly model the posterior probability of class labels, conditioned on the utterance. The resulting model is easy to normalise and can be trained using conditional maximum likelihood estimation. In addition, as a convex model, the optimisation converges to a global maximum
Keywords :
maximum likelihood estimation; probability; speech recognition; augmented statistical models; conditional augmented models; conditional maximum likelihood estimation; posterior probability; speech recognition; Acoustical engineering; Hidden Markov models; Maximum likelihood estimation; Mutual information; Speech recognition; Statistical distributions; Statistics; Taylor series; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1659974
Filename :
1659974
Link To Document :
بازگشت