DocumentCode :
730803
Title :
Arithmetic coding of speech and audio spectra using tcx based on linear predictive spectral envelopes
Author :
Backstrom, Tom ; Helmrich, Christian R.
Author_Institution :
Int. Audio Labs. Erlangen1, Friedrich-Alexander Univ. (FAU), Erlangen, Germany
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
5127
Lastpage :
5131
Abstract :
Unified speech and audio codecs often use a frequency domain coding technique of the transform coded excitation (TCX) type. It is based on modeling the speech source with a linear predictor, spectral weighting by a perceptual model and entropy coding of the frequency components. While previous approaches have used neighbouring frequency components to form a probability model for the entropy coder of spectral components, we propose to use the magnitude of the linear predictor to estimate the variance of spectral components. Since the linear predictor is transmitted in any case, this method does not require any additional side info. Subjective measurements show that the proposed methods give a statistically significant improvement in perceptual quality when the bit-rate is held constant. Consequently, the proposed method has been adopted to the 3GPP Enhanced Voice Services speech coding standard.
Keywords :
arithmetic codes; audio coding; entropy codes; frequency-domain analysis; speech codecs; speech coding; statistical analysis; transform coding; 3GPP enhanced voice service speech coding standard; TCX; audio spectra arithmetic coding; frequency component entropy coding; frequency domain coding technique; linear predictive spectral envelope; linear predictor; perceptual model; perceptual quality; probability model; spectral component entropy coder; spectral weighting; speech arithmetic coding; speech source model; statistical bit rate; transform coded excitation; unified audio codec; unified speech codec; Codecs; Predictive models; Speech; Speech coding; Standards; Transform coding; arithmetic coding; frequency domain coding; speech and audio coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178948
Filename :
7178948
Link To Document :
بازگشت