DocumentCode :
959824
Title :
Mixtures of inverse covariances
Author :
Vanhoucke, Vincent ; Sankar, Ananth
Author_Institution :
Speech R&D Group, Menlo Park, CA, USA
Volume :
12
Issue :
3
fYear :
2004
fDate :
5/1/2004 12:00:00 AM
Firstpage :
250
Lastpage :
264
Abstract :
We describe a model which approximates full covariances in a Gaussian mixture while reducing significantly both the number of parameters to estimate and the computations required to evaluate the Gaussian likelihoods. In this model, the inverse covariance of each Gaussian in the mixture is expressed as a linear combination of a small set of prototype matrices that are shared across components. In addition, we demonstrate the benefits of a subspace-factored extension of this model when representing independent or near-independent product densities. We present a maximum likelihood estimation algorithm for these models, as well as a practical method for implementing it. We show through experiments performed on a variety of speech recognition tasks that this model significantly outperforms a diagonal covariance model, while using far fewer Gaussian-specific parameters. Experiments also demonstrate that a better speed/accuracy tradeoff can be achieved on a real-time speech recognition system.
Keywords :
Gaussian processes; covariance analysis; maximum likelihood estimation; speech recognition; Gaussian mixture model; acoustic modeling; automatic speech recognition; inverse covariances; maximum-likelihood estimation algorithm; parameter estimation; prototype matrices; subspace-factored extension; Covariance matrix; Gaussian processes; Inverse problems; Maximum likelihood estimation; Parameter estimation; Prototypes; Real time systems; Robustness; Scalability; Speech recognition;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2004.825675
Filename :
1288152
Link To Document :
بازگشت