DocumentCode :
1701713
Title :
Speech spectrum quantization using Gaussian mixture models and multi-dimensional companding
Author :
Subramaniam, Anand D. ; Gardne, William R. ; Rao, Bhaskar D.
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., San Diego, La Jolla, CA, USA
fYear :
2002
Firstpage :
5
Lastpage :
7
Abstract :
A low complexity, high quality, quantization scheme using Gaussian mixture models and multi-dimensional companding is presented for speech spectrum quantization. As in past work (Subramaniam and Rao 2000), the probability density function (pdf) of the source is modeled as a Gaussian mixture model whose parameters are estimated using the EM algorithm. The novelty of the approach lies in the development of a more efficient mapping from the density to the codebook vectors. The individual Gaussian component densities are quantized using an optimal multidimensional compressor function followed by a lattice vector quantizer (LVQ). For the purpose of speech spectrum quantization, the E8 lattice is used for lattice quantization. Low complexity encoding and index assignment algorithms are presented and the performance is compared and shown to be better than that of PDF optimized parametric VQ.
Keywords :
Gaussian processes; iterative methods; speech coding; vector quantisation; EM algorithm; Gaussian mixture models; LVQ; codebook vectors; lattice vector quantizer; low complexity high quality quantization scheme; multi-dimensional companding; probability density function; speech spectrum quantization; Clustering algorithms; Encoding; Frequency; Lattices; Multidimensional systems; Parameter estimation; Probability density function; Quantization; Source coding; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Coding, 2002, IEEE Workshop Proceedings.
Print_ISBN :
0-7803-7549-1
Type :
conf
DOI :
10.1109/SCW.2002.1215705
Filename :
1215705
Link To Document :
بازگشت