Title :
A loudness enhancement technique for speech
Author :
Boillot, Marc A. ; Harris, John G.
Author_Institution :
Computational Neuro-Eng. Lab., Florida Univ., Gainesville, FL, USA
Abstract :
We propose an application of the vocoder postfilter to increase perceived loudness of clean speech without increasing signal energy or degrading intelligibility. The critical band concept in auditory theory states that perceived loudness of a narrow-band signal will increase when the bandwidth of that signal increases beyond a critical band, even though the energy remains constant. Our post-filter technique applies formant bandwidth expansion to the vowel regions of speech without changing the vowel power to elevate perceived loudness. Vowels are known to contain the highest energy, have a smooth spectral envelope, long temporal sustenance, and for this reason are suitable candidates to target for a loudness enhancement technique. ISO-532B loudness analysis patterns and listening tests are provided to demonstrate a perceptual loudness improvement corresponding to a 2dB power gain.
Keywords :
filtering theory; loudness; speech enhancement; vocoders; 2 dB; ISO-532B loudness analysis patterns; auditory theory; critical band concept; formant bandwidth expansion; listening tests; loudness enhancement; narrow-band signal; perceived loudness elevation; perceptual loudness improvement; signal energy; spectral envelope; speech intelligibility; speech vowel regions; temporal sustenance; vocoder postfilter; vowel power; Acoustic noise; Adaptive filters; Bandwidth; Degradation; Neural engineering; Pollution measurement; Signal to noise ratio; Speech enhancement; Steady-state; Testing;
Conference_Titel :
Circuits and Systems, 2004. ISCAS '04. Proceedings of the 2004 International Symposium on
Print_ISBN :
0-7803-8251-X
DOI :
10.1109/ISCAS.2004.1329883