Title :
ITU-T EV-VBR: A robust 8-32 kbit/s scalable coder for error prone telecommunications channels
Author :
Vaillancourt, Tommy ; Jelinek, Milan ; Ertan, A. Erdem ; Stachurski, Jacek ; Ramo, Anssi ; Laaksonen, Lasse ; Gibbs, Jon ; Mittal, Udar ; Bruhn, Stefan ; Grancharov, Volodya ; Oshikiri, Masahiro ; Ehara, Hiroyuki ; Dejun Zhang ; Fuwei Ma ; Virette, David
Author_Institution :
VoiceAge/Univ. of Sherbrooke, Sherbrooke, QC, Canada
Abstract :
This paper presents ITU-T Embedded Variable Bit-Rate (EV-VBR) codec being standardized by Question 9 of Study Group 16 (Q9/16) as recommendation G.718. The codec provides a scalable solution for compression of 16 kHz sampled speech and audio signals at rates between 8 kbit/s and 32 kbit/s, robust to significant rates of frame erasures or packet losses. It comprises 5 layers where higher layer bitstreams can be discarded without affecting the lower layer decoding. The core layer takes advantage of signal-classification based CELP encoding. The second layer reduces the coding error from the first layer by means of additional pitch contribution and another algebraic codebook. The higher layers encode the weighted error signal from lower layers using MDCT transform coding. Several technologies are used to encode the MDCT coefficients for best performance both for speech and music. The codec performance is demonstrated with selected results from ITU-T Characterization test.
Keywords :
audio coding; channel coding; codecs; discrete cosine transforms; signal classification; speech coding; transform coding; variable rate codes; CELP encoding; ITU-T EV-VBR codec; ITU-T embedded variable bit-rate codec; MDCT transform coding; algebraic codebook; audio signal; bit rate 8 kbit/s to 32 kbit/s; error prone telecommunications channel; frequency 16 kHz; scalable coder; signal-classification; speech signal; Codecs; Decoding; Delays; Niobium; Speech; Speech coding;
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne