• DocumentCode
    1652530
  • Title

    A new bandwidth extension technology for MPEG Unified Speech and Audio Coding

  • Author

    Yamamoto, Yusaku ; Chinen, Toru ; Nishiguchi, Masayuki

  • Author_Institution
    Sony Corp., Tokyo, Japan
  • fYear
    2013
  • Firstpage
    523
  • Lastpage
    527
  • Abstract
    In January 2012, MPEG finalized the new MPEG-D Unified Speech and Audio Coding (USAC) standard, which enables the coding of a variety of audio content at low bitrates. USAC provides low-bitrate coding by integrating a speech codec and an audio codec into a unified system. In USAC, Predictive Vector Coding (PVC) is added to Enhanced Spectral Band Replication (eSBR) to improve the subjective quality, especially for speech at low bitrates. For speech signals, there is generally a relatively high correlation between the spectral envelopes of low- and high-frequency bands. The PVC scheme exploits this by predicting the high-frequency envelopes from the low-frequency ones, with the coefficient matrices for the prediction being coded by means of vector quantization.
  • Keywords
    audio coding; speech codecs; speech coding; vector quantisation; video coding; MPEG-D; PVC; USAC standard; audio codec; bandwidth extension technology; eSBR; enhanced spectral band replication; low-bitrate coding; predictive vector coding; speech codec; speech signals; unified speech and audio coding; vector quantization; Bit rate; Hafnium; MONOS devices; Multiple signal classification; Speech; Speech coding; Vectors; Bandwidth extension; Predictive Vector Coding (PVC); Unified Speech and Audio Coding (USAC);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6637702
  • Filename
    6637702