• DocumentCode
    700191
  • Title

    ITU-T EV-VBR: A robust 8-32 kbit/s scalable coder for error prone telecommunications channels

  • Author

    Vaillancourt, Tommy ; Jelinek, Milan ; Ertan, A. Erdem ; Stachurski, Jacek ; Ramo, Anssi ; Laaksonen, Lasse ; Gibbs, Jon ; Mittal, Udar ; Bruhn, Stefan ; Grancharov, Volodya ; Oshikiri, Masahiro ; Ehara, Hiroyuki ; Dejun Zhang ; Fuwei Ma ; Virette, David

  • Author_Institution
    VoiceAge/Univ. of Sherbrooke, Sherbrooke, QC, Canada
  • fYear
    2008
  • fDate
    25-29 Aug. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This paper presents ITU-T Embedded Variable Bit-Rate (EV-VBR) codec being standardized by Question 9 of Study Group 16 (Q9/16) as recommendation G.718. The codec provides a scalable solution for compression of 16 kHz sampled speech and audio signals at rates between 8 kbit/s and 32 kbit/s, robust to significant rates of frame erasures or packet losses. It comprises 5 layers where higher layer bitstreams can be discarded without affecting the lower layer decoding. The core layer takes advantage of signal-classification based CELP encoding. The second layer reduces the coding error from the first layer by means of additional pitch contribution and another algebraic codebook. The higher layers encode the weighted error signal from lower layers using MDCT transform coding. Several technologies are used to encode the MDCT coefficients for best performance both for speech and music. The codec performance is demonstrated with selected results from ITU-T Characterization test.
  • Keywords
    audio coding; channel coding; codecs; discrete cosine transforms; signal classification; speech coding; transform coding; variable rate codes; CELP encoding; ITU-T EV-VBR codec; ITU-T embedded variable bit-rate codec; MDCT transform coding; algebraic codebook; audio signal; bit rate 8 kbit/s to 32 kbit/s; error prone telecommunications channel; frequency 16 kHz; scalable coder; signal-classification; speech signal; Codecs; Decoding; Delays; Niobium; Speech; Speech coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2008 16th European
  • Conference_Location
    Lausanne
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7080723