• DocumentCode
    1066646
  • Title

    Gaussian Mixture Kalman Predictive Coding of Line Spectral Frequencies

  • Author

    Subasingha, Shaminda ; Murthi, Manohar N. ; Andersen, Søren Vang

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Miami, Coral Gables, FL
  • Volume
    17
  • Issue
    2
  • fYear
    2009
  • Firstpage
    379
  • Lastpage
    391
  • Abstract
    Gaussian mixture model (GMM)-based predictive coding of line spectral frequencies (LSFs) has gained wide acceptance. In such coders, each mixture of a GMM can be interpreted as defining a linear predictive transform coder. In this paper, we use Kalman filtering principles to model each of these linear predictive transform coders to present GMM Kalman predictive coding. In particular, we show how suitable modeling of quantization noise leads to an adaptive a posteriori GMM that defines a signal-adaptive predictive coder that provides improved coding of LSFs in comparison with the baseline recursive GMM predictive coder. Moreover, we show how running the GMM Kalman predictive coders to convergence can be used to design a stationary GMM Kalman predictive coding system which again provides improved coding of LSFs but now with only a modest increase in run-time complexity over the baseline. In packet loss conditions, this stationary GMM Kalman predictive coder provides much better performance than the recursive GMM predictive coder, and in fact has comparable mean performance to a memoryless GMM coder. Finally, we illustrate how one can utilize Kalman filtering principles to design a postfilter which enhances decoded vectors from a recursive GMM predictive coder without any modifications to the encoding process.
  • Keywords
    Gaussian processes; Kalman filters; adaptive codes; convergence; spectral analysis; speech coding; vector quantisation; Kalman filtering principle; Kalman predictive transform coding; adaptive a posteriori Gaussian mixture model; convergence; line spectral frequency; postfilter; run-time complexity; signal-adaptive predictive coder; speech coding; vector quantization; Convergence; Filtering; Frequency; Kalman filters; Nonlinear filters; Performance loss; Predictive coding; Predictive models; Quantization; Runtime; Gaussian mixture models (GMMs); Kalman filtering; speech coding; vector quantization (VQ);
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2008.2008735
  • Filename
    4749461