Title :
PDF optimized parametric vector quantization of speech line spectral frequencies
Author :
Subramaniam, Anand D. ; Rao, Bhaskar D.
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., San Diego, La Jolla, CA, USA
Abstract :
A computationally efficient, high quality, vector quantization scheme based on a parametric probability density function (PDF) is developed for encoding speech line spectral frequencies (LSF). For this purpose, speech LSFs are modeled as i.i.d realizations of a multivariate normal mixture density. The mixture model parameters are efficiently estimated from the training data using the expectation maximization (EM) algorithm. The estimated density is suitably quantized using transform coding and bit-allocation techniques for both fixed rate and variable rate systems. Source encoding using the resultant codebook involves no searches and its computational complexity is minimal and independent of the rate of the system. Experimental results show that the proposed scheme provides 2-3 bits gain over conventional MSVQ schemes. The proposed memoryless quantizer is enhanced to form a quantizer with memory. The quantizer with memory provides transparent quality speech at 20 bits/frame
Keywords :
computational complexity; memoryless systems; optimisation; parameter estimation; probability; source coding; spectral analysis; speech coding; speech intelligibility; transform coding; vector quantisation; EM algorithm; PDF optimized parametric VQ; bit-allocation; codebook; computational complexity; computationally efficient method; expectation maximization algorithm; experimental results; fixed rate system; high quality scheme; i.i.d realizations; memory; memoryless quantizer; mixture model parameter estimation; multivariate normal mixture density; parametric probability density function; source encoding; speech coding; speech line spectral frequencies; speech quantization; training data; transform coding; transparent quality speech; variable rate system; vector quantization; Electronic mail; Encoding; Frequency; Multidimensional systems; Parametric statistics; Probability density function; Speech; Training data; Transform coding; Vector quantization;
Conference_Titel :
Speech Coding, 2000. Proceedings. 2000 IEEE Workshop on
Conference_Location :
Delavan, WI
Print_ISBN :
0-7803-6416-3
DOI :
10.1109/SCFT.2000.878407