DocumentCode
2963434
Title
PDF optimized parametric vector quantization with application to speech coding
Author
Subramaniam, Anand D. ; Rao, Bhaskar D.
Author_Institution
California Univ., San Diego, La Jolla, CA, USA
Volume
2
fYear
2000
fDate
Oct. 29 2000-Nov. 1 2000
Firstpage
1475
Abstract
A computationally efficient, high quality vector quantization scheme based on a parametric probability density function (PDF) is proposed. In this scheme, the observations are modeled as i.i.d realizations of a multivariate mixture density. The mixture model parameters are efficiently estimated using the expectation maximization (EM) algorithm. The estimated density is suitably quantized using transform coding and bit allocation techniques for both fixed rate and variable rate systems. The usefulness of the approach is demonstrated for speech coding where Gaussian mixture models are used to model speech line spectral frequencies. An attractive feature of this method is that source encoding using the resultant codebook involves no searches and its computational complexity is minimal and independent of the rate of the system. Furthermore, the proposed scheme is scalable and can switch between memoryless quantizer and quantizer with memory seamlessly. The quantizer with memory is shown to provide transparent quality speech at 16 bits/frame.
Keywords
Gaussian processes; computational complexity; memoryless systems; optimisation; probability; source coding; spectral analysis; speech coding; speech intelligibility; transform coding; vector quantisation; Gaussian mixture models; PDF optimized parametric VQ; bit allocation; codebook; computational complexity; expectation maximization algorithm; fixed rate systems; i.i.d realizations; memoryless quantizer; mixture model parameters; multivariate mixture density; parametric probability density function; quantizer memory; source encoding; speech coding; speech line spectral frequencies; transform coding; transparent quality speech; variable rate systems; vector quantization; Bit rate; Computational complexity; Encoding; Frequency; Parameter estimation; Probability density function; Speech coding; Switches; Transform coding; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Signals, Systems and Computers, 2000. Conference Record of the Thirty-Fourth Asilomar Conference on
Conference_Location
Pacific Grove, CA, USA
ISSN
1058-6393
Print_ISBN
0-7803-6514-3
Type
conf
DOI
10.1109/ACSSC.2000.911235
Filename
911235
Link To Document