• DocumentCode
    779999
  • Title

    Wideband Speech Coding Advances in VMR-WB Standard

  • Author

    Jelínek, Milan ; Salami, Redwan

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Sherbrooke Univ., Que.
  • Volume
    15
  • Issue
    4
  • fYear
    2007
  • fDate
    5/1/2007 12:00:00 AM
  • Firstpage
    1167
  • Lastpage
    1179
  • Abstract
    This paper presents novel techniques for source-controlled variable-rate wideband speech coding. These techniques have been used in the variable-rate multimode wideband (VMR-WB) speech codec recently selected by the Third-Generation Partnership Project 2 (3GPP2) for wideband (WB) speech telephony, streaming, and multimedia messaging services in the cdma2000 third-generation wireless system. The codec utilizes efficient coding modes optimized for different classes of speech signal including generic coding based on AMR-WB for transients and onsets, voiced coding optimized for stable voiced signals, unvoiced coding optimized for unvoiced segments, and comfort noise generation for inactive segments. Several innovations enable very good performance at average bit rates below 8 kb/s for active speech coding. The article presents an overview of the codec and describes in detail some of the codec novel features: Robust pitch tracking algorithm, coding-mode dependent prediction of linear prediction (LP) filter quantization, and novel frame erasure concealment techniques including supplementary information for reconstruction of lost onsets and improving decoder convergence. Selected results from the Selection and Characterization tests of the codec illustrate its performance
  • Keywords
    3G mobile communication; broadband networks; electronic messaging; filtering theory; linear predictive coding; media streaming; speech coding; voice communication; 3GPP2; Third-Generation Partnership Project 2; VMR-WB standard; cmda2000; coding-mode dependent prediction; frame erasure concealment techniques; generic coding; linear prediction filter quantization; multimedia messaging services; robust pitch tracking algorithm; source-controlled variable-rate coding; speech signal; stable voiced signals; streaming; unvoiced coding; variable-rate multimode wideband speech codec; wideband speech coding; wideband speech telephony; Message service; Multimedia systems; Noise generators; Signal generators; Speech codecs; Speech coding; Speech enhancement; Streaming media; Telephony; Wideband; Linear predictive coding; standardization; variable-rate speech coding; wideband speech coding;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.894514
  • Filename
    4156201