DocumentCode
454538
Title
Augmented Statistical Models for Speech Recognition
Author
Layton, M.I. ; Gales, M.J.F.
Author_Institution
Eng. Dept., Cambridge Univ.
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
Recently there has been significant interest in developing new acoustic models for speech recognition. One such model, that allows complex dependencies to be represented, is the augmented statistical model. This incorporates additional dependencies by constructing a local exponential expansion of a standard HMM. Unfortunately, the resulting model often has an intractable normalisation term, rendering training difficult for all but binary classification tasks. In this paper, conditional augmented (C-Aug) models are proposed as an attractive alternative. Instead of modelling utterance likelihoods and inferring decision boundaries, C-Aug models directly model the posterior probability of class labels, conditioned on the utterance. The resulting model is easy to normalise and can be trained using conditional maximum likelihood estimation. In addition, as a convex model, the optimisation converges to a global maximum
Keywords
maximum likelihood estimation; probability; speech recognition; augmented statistical models; conditional augmented models; conditional maximum likelihood estimation; posterior probability; speech recognition; Acoustical engineering; Hidden Markov models; Maximum likelihood estimation; Mutual information; Speech recognition; Statistical distributions; Statistics; Taylor series; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1659974
Filename
1659974
Link To Document