DocumentCode
779987
Title
Fourier Transform Vector Quantization for Speech Coding
Author
Chang, Pao-Chi ; Gray, Robert M. ; May, Jack
Author_Institution
IBM Res. Center, Yorktown Heights, NY
Volume
35
Issue
10
fYear
1987
fDate
10/1/1987 12:00:00 AM
Firstpage
1059
Lastpage
1068
Abstract
Design algorithms and simulation results are presented for vector quantizers for Fourier transformed data. Transforming the data prior to quantization has two potential advantages. First, each sample in the transform domain depends on many samples in the original domain. Thus, even scalar quantization in the transform domain is a form of vector quantization or block source coding in the original waveform domain and the basic coding theorems of information theory show that such block codes can provide better performance than scalar codes, even for memoryless sources. Second, vector quantization of Fourier transformed speech waveforms provides distinctly better subjective quality than ordinary vector quantization of the waveform using codes of comparable complexity. While the system is, of course, more complicated due to the need to take Fourier transforms, its envisioned application is as a coder for the output of FFT chips currently available or under development. The proposed implementation of a Fourier transform vector quantizer (FTVQ) uses a product code structure, providing different codes for different coefficient vectors corresponding to different frequency bands. This is a form of subband coding and yields a simple means of optimizing bit allocations among the subcodes. Two coding structures with corresponding distortion measures are considered: those that quantize vectors of pairs of real and imaginary coefficients and those that quantize separate vectors of magnitude and phase coefficients. Both structures yield good performance for the given complexity in comparison to waveform vector quantizers. For speech coding, a magnitude-phase FTVQ yields better subjective quality than a real-imaginary FTVQ when the rate allocation is properly chosen.
Keywords
Fourier transforms; Quantization; Speech coding; Algorithm design and analysis; Bit rate; Block codes; Fourier transforms; Frequency; Information theory; Product codes; Source coding; Speech coding; Vector quantization;
fLanguage
English
Journal_Title
Communications, IEEE Transactions on
Publisher
ieee
ISSN
0090-6778
Type
jour
DOI
10.1109/TCOM.1987.1096683
Filename
1096683
Link To Document