DocumentCode :
2020123
Title :
High-quality speech coding at 2.4 to 4.0 kbit/s based on time-frequency interpolation
Author :
Shoham, Yair
Author_Institution :
AT&T Bell Lab., Murray Hill, NJ, USA
Volume :
2
fYear :
1993
fDate :
27-30 April 1993
Firstpage :
167
Abstract :
The author presents a novel algorithm for high-quality coding and demonstrates the advantage of the proposed coder over the conventional CELP (code-excited linear prediction) algorithm for low rate coding. He proposes an empirical but perceptually advantageous framework for voice speech processing, called time-frequency interpolation (TFI). The general formulation of the TFI technique is given and then a TFI speech coder is described. The performance of this coder at 4.05 and 2.5 kbit/s is demonstrated in terms of formal MOS (mean opinion score) scores. It is shown that the 4.05 kbit/s TFI coder is comparable in performance with the 8 kbit/s European standard GSM (Group Special Mobile) coder. It is also shown that reducing the bit rate to 2.50 kbit/s only gracefully degrades the performance and the coder delivers good-quality speech at this rate.<>
Keywords :
interpolation; speech coding; time-frequency analysis; vocoders; 2.4 to 4.0 kbit/s; GSM; algorithm; code-excited linear prediction; mean opinion score; performance; speech coding; time-frequency interpolation; voice speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.1993.319260
Filename :
319260
Link To Document :
بازگشت