DocumentCode :
835819
Title :
Low-complexity source coding using Gaussian mixture models, lattice vector quantization, and recursive coding with application to speech spectrum quantization
Author :
Subramaniam, Anand D. ; Gardner, William R. ; Rao, Bhaskar D.
Volume :
14
Issue :
2
fYear :
2006
fDate :
3/1/2006 12:00:00 AM
Firstpage :
524
Lastpage :
532
Abstract :
In this paper, we use the Gaussian mixture model (GMM) based multidimensional companding quantization framework to develop two important quantization schemes. In the first scheme, the scalar quantization in the companding framework is replaced by more efficient lattice vector quantization. Low-complexity lattice pruning and quantization schemes are provided for the E8 Gossett lattice. At moderate to high bit rates, the proposed scheme recovers much of the space-filling loss due to the product vector quantizers (PVQ) employed in earlier work, and thereby, provides improved performance with a marginal increase in complexity. In the second scheme, we generalize the compression framework to accommodate recursive coding. In this approach, the joint probability density function (PDF) of the parameter vectors of successive source frames is modeled using a GMM. The conditional density of the parameter vector of the current source frame based on the quantized values of the parameter vector of the previous source frames is used to generate a new codebook for every current source frame. We demonstrate the efficacy of the proposed schemes in the application of speech spectrum quantization. The proposed scheme is shown to provide superior performance with moderate increase in complexity when compared with conventional one-step linear prediction based compression schemes for both narrow-band and wide-band speech.
Keywords :
Gaussian processes; computational complexity; data compression; recursive estimation; speech coding; vector quantisation; Gaussian mixture models; joint probability density function; lattice vector quantization; linear prediction coding; low-complexity source coding; multidimensional companding quantization framework; narrowband speech; product vector quantizers; recursive coding; speech spectrum quantization; wideband speech; Bit rate; Lattices; Multidimensional systems; Narrowband; Performance loss; Probability density function; Source coding; Speech coding; Vector quantization; Wideband; Gaussian mixture models; lattice vector quantization; memoryless coders; multidimensional companding; recursive coding; speech spectrum quantization; vector quantization; wide-band speech;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TSA.2005.855839
Filename :
1597257
Link To Document :
بازگشت