Title :
Variable-dimension quantization of sinusoidal amplitudes using Gaussian mixture models
Author :
Lindblom, Jonas ; Hedelin, Per
Author_Institution :
Sch. of Electr. Eng., Chalmers Univ. of Technol., Goteborg, Sweden
Abstract :
In this paper, Gaussian mixture (GM) models are used to design variable-dimension quantizers according to a weighted distortion criterion. A general method for combining a variable-to-fixed dimension transform, with GM modeling and quantization, is proposed. The method provides a convenient and efficient way to encode the amplitudes in a sinusoidal speech coder. Quantizers designed according to the proposed scheme are evaluated both according to weighted distortion criteria, and with respect to a high-rate bound approximation of the distortion. Informal listening tests suggest that the amplitudes can be encoded without subjective loss in a wideband harmonic coder, at a rate around 40 bits per frame (for the amplitudes only).
Keywords :
Gaussian distribution; quantisation (signal); speech coding; transform coding; vocoders; Gaussian mixture models; high-rate bound approximation; informal listening tests; sinusoidal amplitudes; sinusoidal speech coder; variable-dimension quantization; variable-dimension quantizers; variable-to-fixed dimension transform; weighted distortion criterion; wideband harmonic coder; Cost function; Discrete transforms; Frequency; Matrix converters; Quantization; Sampling methods; Speech; Testing; Vectors; Wideband;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1325945