DocumentCode
3527192
Title
A fast, accurate approximation to log likelihood of Gaussian mixture models
Author
Dognin, Pierre L. ; Goel, Vaibhava ; Hershey, John R. ; Olsen, Peder A.
Author_Institution
IBM T.J. Watson Res. Center, Yorktown Heights, NY
fYear
2009
fDate
19-24 April 2009
Firstpage
3817
Lastpage
3820
Abstract
It has been a common practice in speech recognition and elsewhere to approximate the log likelihood of a Gaussian mixture model (GMM) with the maximum component log likelihood. While often a computational necessity, the max approximation comes at a price of inferior modeling when the Gaussian components significantly overlap. This paper shows how the approximation error can be reduced by changing component priors. In our experiments the loss in word error rate due to max approximation, albeit small, is reduced by 50-100% at no cost in computational efficiency. Furthermore, we expect acoustic models will become larger with time and increase component overlap and word error rate loss. This makes reducing the approximation error more relevant. The techniques considered do not use the original data and can easily be applied as a post-processing step to any GMM.
Keywords
Gaussian processes; approximation theory; maximum likelihood estimation; speech recognition; Gaussian mixture model; maximum component log likelihood approximation; speech recognition; Approximation error; Bismuth; Computational efficiency; Context modeling; Error analysis; Exponential distribution; Hidden Markov models; Speech recognition; Upper bound; Gaussian mixture model; acoustic model; exponential distribution; maximum approximation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960459
Filename
4960459
Link To Document