DocumentCode :
417131
Title :
Variable-dimension quantization of sinusoidal amplitudes using Gaussian mixture models
Author :
Lindblom, Jonas ; Hedelin, Per
Author_Institution :
Sch. of Electr. Eng., Chalmers Univ. of Technol., Goteborg, Sweden
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
In this paper, Gaussian mixture (GM) models are used to design variable-dimension quantizers according to a weighted distortion criterion. A general method for combining a variable-to-fixed dimension transform, with GM modeling and quantization, is proposed. The method provides a convenient and efficient way to encode the amplitudes in a sinusoidal speech coder. Quantizers designed according to the proposed scheme are evaluated both according to weighted distortion criteria, and with respect to a high-rate bound approximation of the distortion. Informal listening tests suggest that the amplitudes can be encoded without subjective loss in a wideband harmonic coder, at a rate around 40 bits per frame (for the amplitudes only).
Keywords :
Gaussian distribution; quantisation (signal); speech coding; transform coding; vocoders; Gaussian mixture models; high-rate bound approximation; informal listening tests; sinusoidal amplitudes; sinusoidal speech coder; variable-dimension quantization; variable-dimension quantizers; variable-to-fixed dimension transform; weighted distortion criterion; wideband harmonic coder; Cost function; Discrete transforms; Frequency; Matrix converters; Quantization; Sampling methods; Speech; Testing; Vectors; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325945
Filename :
1325945
Link To Document :
بازگشت