Title :
PDF optimized parametric vector quantization with application to speech coding
Author :
Subramaniam, Anand D. ; Rao, Bhaskar D.
Author_Institution :
California Univ., San Diego, La Jolla, CA, USA
fDate :
Oct. 29 2000-Nov. 1 2000
Abstract :
A computationally efficient, high quality vector quantization scheme based on a parametric probability density function (PDF) is proposed. In this scheme, the observations are modeled as i.i.d realizations of a multivariate mixture density. The mixture model parameters are efficiently estimated using the expectation maximization (EM) algorithm. The estimated density is suitably quantized using transform coding and bit allocation techniques for both fixed rate and variable rate systems. The usefulness of the approach is demonstrated for speech coding where Gaussian mixture models are used to model speech line spectral frequencies. An attractive feature of this method is that source encoding using the resultant codebook involves no searches and its computational complexity is minimal and independent of the rate of the system. Furthermore, the proposed scheme is scalable and can switch between memoryless quantizer and quantizer with memory seamlessly. The quantizer with memory is shown to provide transparent quality speech at 16 bits/frame.
Keywords :
Gaussian processes; computational complexity; memoryless systems; optimisation; probability; source coding; spectral analysis; speech coding; speech intelligibility; transform coding; vector quantisation; Gaussian mixture models; PDF optimized parametric VQ; bit allocation; codebook; computational complexity; expectation maximization algorithm; fixed rate systems; i.i.d realizations; memoryless quantizer; mixture model parameters; multivariate mixture density; parametric probability density function; quantizer memory; source encoding; speech coding; speech line spectral frequencies; transform coding; transparent quality speech; variable rate systems; vector quantization; Bit rate; Computational complexity; Encoding; Frequency; Parameter estimation; Probability density function; Speech coding; Switches; Transform coding; Vector quantization;
Conference_Titel :
Signals, Systems and Computers, 2000. Conference Record of the Thirty-Fourth Asilomar Conference on
Conference_Location :
Pacific Grove, CA, USA
Print_ISBN :
0-7803-6514-3
DOI :
10.1109/ACSSC.2000.911235