DocumentCode
1118064
Title
A High-Rate Optimal Transform Coder With Gaussian Mixture Companders
Author
Duni, Ethan R. ; Rao, Bhaskar D.
Author_Institution
Dept. of Electr. & Comput. Eng., California Univ., San Diego, La Jolla, CA
Volume
15
Issue
3
fYear
2007
fDate
3/1/2007 12:00:00 AM
Firstpage
770
Lastpage
783
Abstract
This paper examines the problem of designing fixed-rate transform coders for sources whose distributions are unknown and presumably non-Gaussian, under input-weighted squared error distortion measures. As a component of this system, a flexible scalar compander based on Gaussian mixtures is proposed. The high-rate analysis of transform coders is reviewed, and extended to the case of input-weighted squared error. An algorithm is developed to set the parameters of the system using a data-driven technique that automatically balances the source statistics, distortion measure, and structure of the transform coder to minimize the high-rate distortion. The implementation of Gaussian mixture companders is explored, resulting in a flexible, low-complexity scalar quantizer. Additionally, modifications to the system for operation at moderate rates, using unstructured scalar quantizers, are presented. The operation of the system for the problem of wideband speech line spectral frequencies (LSF) quantization with log spectral distortion is illustrated, and shown to provide good performance with very low complexity
Keywords
Gaussian processes; compandors; speech coding; transform coding; Gaussian mixture companders; fixed-rate transform coders; flexible low-complexity scalar quantizer; flexible scalar compander; high-rate optimal transform coder; input-weighted squared error distortion measures; log spectral distortion; source statistics; wideband speech line spectral frequencies quantization; Distortion measurement; Frequency; Karhunen-Loeve transforms; Nonlinear distortion; Parameter estimation; Quantization; Speech; Statistics; Transform coding; Wideband; Compander; high-rate quantization; log spectral distortion (LSD); parameter estimation; transform coder; wideband speech;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2006.885905
Filename
4100676
Link To Document