Title :
Spectral dynamics is more important than spectral distortion
Author :
Knagenhjelm, H. Petter ; Kleijn, W. Bastiaan
Author_Institution :
Speech Coding Res. Dept., AT&T Bell Labs., Murray Hill, NJ, USA
Abstract :
Linear prediction coefficients are used to describe the power-spectrum envelope in the majority of low-bit-rate coders. The performance of quantizers for the linear-prediction coefficients is generally evaluated in terms of spectral distortion. This paper shows that the audible distortion in low-bit-rate coders is often more a function of the dynamics of the power-spectrum envelope than of the spectral distortion as usually evaluated. Smoothing the evolution of the power-spectrum envelope over time increases the reconstructed speech quality. A reasonable objective is to find the smoothest path that keeps the quantized parameters within the Voronoi regions associated with the transmitted quantization index. We demonstrate increased quantizer performance by such smoothing of the line-spectral frequencies
Keywords :
computational geometry; linear predictive coding; smoothing methods; spectral analysis; speech coding; vocoders; Voronoi regions; audible distortion; line-spectral frequencies; linear prediction coefficients; low-bit-rate coders; performance; power-spectrum envelope; quantizers; reconstructed speech quality; smoothing; spectral distortion; spectral dynamics; transmitted quantization index; Added delay; Auditory system; Bit rate; Distortion measurement; Frequency; Humans; Interpolation; Quantization; Smoothing methods; Speech coding;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479798