DocumentCode :
1728767
Title :
Rate distortion performance bounds for wideband speech
Author :
Gibson, Jerry D. ; Li, Ying-Yi
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, Santa Barbara, CA, USA
fYear :
2012
Firstpage :
186
Lastpage :
191
Abstract :
We develop new rate distortion bounds for wideband speech sources based on phonetically-motivated composite source models, conditional rate distortion theory, and perceptual wideband PESQ (WPESQ) distortion measures. The approach is to calculate rate distortion bounds for MSE distortion for each subsource of the composite source model and use conditional rate distortion theory to calculate the MSE R(D) for the composite source. Since MSE is not a useful distortion measure for today´s best-performing voice codecs, we generate a mapping of MSE-to-WPESQ using fully backward adaptive waveform coders, which have MSE distortion values that correctly order their performance, and for which WPESQ values can be generated. We generate the final rate distortion functions with the mapping and show that our new rate distortion curves lower bound the performance of the best known standardized wideband speech codecs.
Keywords :
rate distortion theory; speech codecs; waveform analysis; MSE distortion; WPESQ distortion measures; composite source model; conditional rate distortion theory; fully backward adaptive waveform coder; perceptual wideband PESQ distortion measures; phonetically-motivated composite source model; rate distortion curves; rate distortion performance bounds; voice codecs; wideband speech codec; wideband speech sources; Distortion measurement; Rate-distortion; Speech; Speech codecs; Speech coding; Wideband; Rate distortion bounds; Speech codec performance; Speech coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory and Applications Workshop (ITA), 2012
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4673-1473-2
Type :
conf
DOI :
10.1109/ITA.2012.6181803
Filename :
6181803
Link To Document :
بازگشت