Title :
Mixtures of inverse covariances
Author :
Vanhoucke, Vincent ; Sankar, Ananth
Author_Institution :
Speech R&D Group, Menlo Park, CA, USA
fDate :
5/1/2004 12:00:00 AM
Abstract :
We describe a model which approximates full covariances in a Gaussian mixture while reducing significantly both the number of parameters to estimate and the computations required to evaluate the Gaussian likelihoods. In this model, the inverse covariance of each Gaussian in the mixture is expressed as a linear combination of a small set of prototype matrices that are shared across components. In addition, we demonstrate the benefits of a subspace-factored extension of this model when representing independent or near-independent product densities. We present a maximum likelihood estimation algorithm for these models, as well as a practical method for implementing it. We show through experiments performed on a variety of speech recognition tasks that this model significantly outperforms a diagonal covariance model, while using far fewer Gaussian-specific parameters. Experiments also demonstrate that a better speed/accuracy tradeoff can be achieved on a real-time speech recognition system.
Keywords :
Gaussian processes; covariance analysis; maximum likelihood estimation; speech recognition; Gaussian mixture model; acoustic modeling; automatic speech recognition; inverse covariances; maximum-likelihood estimation algorithm; parameter estimation; prototype matrices; subspace-factored extension; Covariance matrix; Gaussian processes; Inverse problems; Maximum likelihood estimation; Parameter estimation; Prototypes; Real time systems; Robustness; Scalability; Speech recognition;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
DOI :
10.1109/TSA.2004.825675