• DocumentCode
    2963434
  • Title

    PDF optimized parametric vector quantization with application to speech coding

  • Author

    Subramaniam, Anand D. ; Rao, Bhaskar D.

  • Author_Institution
    California Univ., San Diego, La Jolla, CA, USA
  • Volume
    2
  • fYear
    2000
  • fDate
    Oct. 29 2000-Nov. 1 2000
  • Firstpage
    1475
  • Abstract
    A computationally efficient, high quality vector quantization scheme based on a parametric probability density function (PDF) is proposed. In this scheme, the observations are modeled as i.i.d realizations of a multivariate mixture density. The mixture model parameters are efficiently estimated using the expectation maximization (EM) algorithm. The estimated density is suitably quantized using transform coding and bit allocation techniques for both fixed rate and variable rate systems. The usefulness of the approach is demonstrated for speech coding where Gaussian mixture models are used to model speech line spectral frequencies. An attractive feature of this method is that source encoding using the resultant codebook involves no searches and its computational complexity is minimal and independent of the rate of the system. Furthermore, the proposed scheme is scalable and can switch between memoryless quantizer and quantizer with memory seamlessly. The quantizer with memory is shown to provide transparent quality speech at 16 bits/frame.
  • Keywords
    Gaussian processes; computational complexity; memoryless systems; optimisation; probability; source coding; spectral analysis; speech coding; speech intelligibility; transform coding; vector quantisation; Gaussian mixture models; PDF optimized parametric VQ; bit allocation; codebook; computational complexity; expectation maximization algorithm; fixed rate systems; i.i.d realizations; memoryless quantizer; mixture model parameters; multivariate mixture density; parametric probability density function; quantizer memory; source encoding; speech coding; speech line spectral frequencies; transform coding; transparent quality speech; variable rate systems; vector quantization; Bit rate; Computational complexity; Encoding; Frequency; Parameter estimation; Probability density function; Speech coding; Switches; Transform coding; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 2000. Conference Record of the Thirty-Fourth Asilomar Conference on
  • Conference_Location
    Pacific Grove, CA, USA
  • ISSN
    1058-6393
  • Print_ISBN
    0-7803-6514-3
  • Type

    conf

  • DOI
    10.1109/ACSSC.2000.911235
  • Filename
    911235