Title :
Gaussian-mixture modeling of lattice-based spherical vector quantization performance in transform audio coding
Author :
Patchoo, Wisarn ; Fischer, Thomas R.
Author_Institution :
Sch. of EECS, Washington State Univ., Pullman, WA, USA
Abstract :
A block-based Gaussian mixture model (GMM) is used to model the distribution of transform audio data to be encoded using lattice-based spherical vector quantization (LSVQ). The expectation-maximization algorithm is used to design the GMM to model the marginal density of the transform coefficients and the vector energy density. A GMM-based rate-distortion function is derived and shown to closely match the observed spherical VQ performance. The LSVQ transform audio coding performance is characterized for the best lattices known in 4, 8, 16, and 32 dimensions.
Keywords :
Gaussian processes; audio coding; expectation-maximisation algorithm; quantisation (signal); transform coding; block based Gaussian mixture model; expectation-maximization algorithm; lattice based spherical vector quantization; transform audio coding; Audio coding; Gaussian processes; Vector quantization; Gaussian distributions; audio coding; rate distortion theory; vector quantization;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495823