DocumentCode :
3488813
Title :
Hybrid MELP/CELP coding at bit rates from 6.4 to 2.4 kb/s
Author :
Stachurski, Jacek ; McCree, Alan ; Viswanathan, Vishu ; Heikkinen, Ari ; Ram, Anssi ; Himanen, Sakari ; Blocher, Peter
Author_Institution :
DSP Solutions R&D Center, Texas Instrum. Inc., Dallas, TX, USA
Volume :
2
fYear :
2003
fDate :
6-10 April 2003
Abstract :
This paper describes extensions of the 4 kb/s hybrid MELP/CELP coder, up to 6.4 kb/s and down to 2.4 kb/s. The baseline 4 kb/s coder uses three coding modes: MELP in strongly voiced speech frames, CELP with pitch prediction in weakly voiced frames, and CELP with stochastic excitation in unvoiced frames. To minimize switching artifacts between parametric MELP and waveform CELP coding, an alignment phase is encoded in MELP and zero-phase equalization is applied to the CELP target signal. The 6.4 kb/s extension uses the same three modes as the 4 kb/s coder, with improved MELP and CELP coders. The 2.4 kb/s extension uses only two modes: MELP for voiced frames and CELP synthesis with random excitation for unvoiced frames. The alignment phase is encoded in MELP frames for all bit rates so that time synchrony with input speech is always maintained. Alignment phase and zero-phase equalization enable smooth switching between coders at different bit rates. The hybrid MELP/CELP coding structure leads to coders that perform better at a given bit rate than MELP or CELP separately, and better than or equivalent to higher bit-rate ITU standards. Formal subjective tests show that for all-but-one tested conditions, the 6.4 kb/s hybrid coder is better than 8 kb/s G.729 and the 2.4 kb/s coder is equivalent to, or better than, 6.4 kb/s G.729 Annex D.
Keywords :
data compression; linear predictive coding; speech coding; vocoders; 2.4 kbit/s; 4 kbit/s; 6.4 kbit/s; 8 kbit/s; CELP synthesis; CELP target signal; G.729 Annex D; ITU standards; alignment phase; bit rates; coding modes; formal subjective tests; hybrid MELP/CELP coding; input speech; parametric MELP coding; pitch prediction; random excitation; stochastic excitation; switching artifacts minimization; time synchrony; unvoiced frames; voiced speech frames; waveform CELP coding; zero-phase equalization; Bit rate; Code standards; Digital signal processing; Ear; Encoding; Instruments; Research and development; Speech coding; Stochastic processes; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1202317
Filename :
1202317
Link To Document :
بازگشت