DocumentCode
787259
Title
Predictive trellis coded quantization of speech
Author
Marcellin, Michael W. ; Fischer, Thomas R. ; Gibson, Jerry D.
Author_Institution
Dept. of Electr. & Comput. Eng., Arizona Univ., Tucson, AZ, USA
Volume
38
Issue
1
fYear
1990
fDate
1/1/1990 12:00:00 AM
Firstpage
46
Lastpage
55
Abstract
Trellis-coded quantization (TCQ) is incorporated into a predictive coding structure for encoding sampled speech. The modest complexity of the resulting structure is seen to be a direct consequence of the TCQ formulation. Simulation results are presented for systems using fixed-prediction/fixed-residual encoding, fixed-prediction/adaptive-residual encoding, and adaptive-prediction/adaptive-residual encoding. The performance of predictive TCQ (PTCQ) is compared to that of other waveform coders, and the effects of channel errors on PTCQ performance are discussed. For a fully adaptive 16-kb/s speech coding system, segmental signal-to-noise ratios in the range of 19.1-21.9 dB are obtained for a variety of speakers and test sentences. Reconstructed speech obtained from this system is of excellent communication quality
Keywords
encoding; speech analysis and processing; 16 kbit/s; 19.1 to 21.9 dB; adaptive-prediction/adaptive-residual encoding; channel error effects; communication quality; fixed-prediction/adaptive-residual encoding; fixed-prediction/fixed-residual encoding; predictive trellis coded quantization; reconstructed speech; sampled speech encoding; segmental signal-to-noise ratios; speakers; test sentences; Encoding; History; Modulation coding; Predictive coding; Predictive models; Quantization; Rate-distortion; Signal to noise ratio; Speech coding; System testing;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/29.45617
Filename
45617
Link To Document