DocumentCode
1652530
Title
A new bandwidth extension technology for MPEG Unified Speech and Audio Coding
Author
Yamamoto, Yusaku ; Chinen, Toru ; Nishiguchi, Masayuki
Author_Institution
Sony Corp., Tokyo, Japan
fYear
2013
Firstpage
523
Lastpage
527
Abstract
In January 2012, MPEG finalized the new MPEG-D Unified Speech and Audio Coding (USAC) standard, which enables the coding of a variety of audio content at low bitrates. USAC provides low-bitrate coding by integrating a speech codec and an audio codec into a unified system. In USAC, Predictive Vector Coding (PVC) is added to Enhanced Spectral Band Replication (eSBR) to improve the subjective quality, especially for speech at low bitrates. For speech signals, there is generally a relatively high correlation between the spectral envelopes of low- and high-frequency bands. The PVC scheme exploits this by predicting the high-frequency envelopes from the low-frequency ones, with the coefficient matrices for the prediction being coded by means of vector quantization.
Keywords
audio coding; speech codecs; speech coding; vector quantisation; video coding; MPEG-D; PVC; USAC standard; audio codec; bandwidth extension technology; eSBR; enhanced spectral band replication; low-bitrate coding; predictive vector coding; speech codec; speech signals; unified speech and audio coding; vector quantization; Bit rate; Hafnium; MONOS devices; Multiple signal classification; Speech; Speech coding; Vectors; Bandwidth extension; Predictive Vector Coding (PVC); Unified Speech and Audio Coding (USAC);
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6637702
Filename
6637702
Link To Document