Predictive trellis coded quantization of speech

Author

Marcellin, Michael W. ; Fischer, Thomas R. ; Gibson, Jerry D.

Author_Institution

Dept. of Electr. & Comput. Eng., Arizona Univ., Tucson, AZ, USA

Volume

38

Issue

1

fYear

1990

fDate

1/1/1990 12:00:00 AM

Firstpage

46

Lastpage

55

Abstract

Trellis-coded quantization (TCQ) is incorporated into a predictive coding structure for encoding sampled speech. The modest complexity of the resulting structure is seen to be a direct consequence of the TCQ formulation. Simulation results are presented for systems using fixed-prediction/fixed-residual encoding, fixed-prediction/adaptive-residual encoding, and adaptive-prediction/adaptive-residual encoding. The performance of predictive TCQ (PTCQ) is compared to that of other waveform coders, and the effects of channel errors on PTCQ performance are discussed. For a fully adaptive 16-kb/s speech coding system, segmental signal-to-noise ratios in the range of 19.1-21.9 dB are obtained for a variety of speakers and test sentences. Reconstructed speech obtained from this system is of excellent communication quality

Keywords

encoding; speech analysis and processing; 16 kbit/s; 19.1 to 21.9 dB; adaptive-prediction/adaptive-residual encoding; channel error effects; communication quality; fixed-prediction/adaptive-residual encoding; fixed-prediction/fixed-residual encoding; predictive trellis coded quantization; reconstructed speech; sampled speech encoding; segmental signal-to-noise ratios; speakers; test sentences; Encoding; History; Modulation coding; Predictive coding; Predictive models; Quantization; Rate-distortion; Signal to noise ratio; Speech coding; System testing;

fLanguage

English

Journal_Title

Acoustics, Speech and Signal Processing, IEEE Transactions on

Publisher

ieee

ISSN

0096-3518

Type

jour

DOI

10.1109/29.45617

Filename

45617