DocumentCode
1280115
Title
A low delay 16 kb/s speech coder
Author
Iyengar, Vasu ; Kabal, Peter
Author_Institution
Dept. of Electr. Eng., McGill Univ., Montreal, Que., Canada
Volume
39
Issue
5
fYear
1991
fDate
5/1/1991 12:00:00 AM
Firstpage
1049
Lastpage
1057
Abstract
A code tree generated by a stochastically populated innovations tree with a backward adaptive gain and backward adaptive synthesis filters is considered. The synthesis configuration uses a cascade of two all-pole filters: a pitch (long time delay) filter followed by a formant (short time delay) filter. Both filters are updated using backward adaptation. The formant predictor is updated using an adaptive lattice algorithm. The multipath (M , L ) search algorithm is used to encode the speech. A frequency-weighted error measure is used to reduce the perceptual loudness of the quantization noise. The addition of the pitch filter gives 2-10-dB increase in segSNR (segmental signal-to-noise ratio) in the voiced segments. Subjective testing has shown that the coder attains a subjective quality equivalent to 7 b/sample log-PCM (pulse code modulation) with an encoding delay of eight samples (1 ms with an 8-kHz sampling rate)
Keywords
adaptive filters; encoding; filtering and prediction theory; speech analysis and processing; trees (mathematics); 16 kbit/s; adaptive lattice algorithm; all-pole filters; backward adaptive gain; backward adaptive synthesis filters; code tree; formant filter; frequency-weighted error measure; low delay speech coder; multipath search algorithm; perceptual loudness; pitch filter; quantization noise; segmental signal-to-noise ratio; stochastically populated innovations tree; subjective quality; Adaptive filters; Delay effects; Frequency measurement; Lattices; Noise measurement; Noise reduction; Pulse modulation; Quantization; Speech synthesis; Technological innovation;
fLanguage
English
Journal_Title
Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
1053-587X
Type
jour
DOI
10.1109/78.80962
Filename
80962
Link To Document