DocumentCode
3379389
Title
A loudness enhancement technique for speech
Author
Boillot, Marc A. ; Harris, John G.
Author_Institution
Computational Neuro-Eng. Lab., Florida Univ., Gainesville, FL, USA
Volume
5
fYear
2004
fDate
23-26 May 2004
Abstract
We propose an application of the vocoder postfilter to increase perceived loudness of clean speech without increasing signal energy or degrading intelligibility. The critical band concept in auditory theory states that perceived loudness of a narrow-band signal will increase when the bandwidth of that signal increases beyond a critical band, even though the energy remains constant. Our post-filter technique applies formant bandwidth expansion to the vowel regions of speech without changing the vowel power to elevate perceived loudness. Vowels are known to contain the highest energy, have a smooth spectral envelope, long temporal sustenance, and for this reason are suitable candidates to target for a loudness enhancement technique. ISO-532B loudness analysis patterns and listening tests are provided to demonstrate a perceptual loudness improvement corresponding to a 2dB power gain.
Keywords
filtering theory; loudness; speech enhancement; vocoders; 2 dB; ISO-532B loudness analysis patterns; auditory theory; critical band concept; formant bandwidth expansion; listening tests; loudness enhancement; narrow-band signal; perceived loudness elevation; perceptual loudness improvement; signal energy; spectral envelope; speech intelligibility; speech vowel regions; temporal sustenance; vocoder postfilter; vowel power; Acoustic noise; Adaptive filters; Bandwidth; Degradation; Neural engineering; Pollution measurement; Signal to noise ratio; Speech enhancement; Steady-state; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2004. ISCAS '04. Proceedings of the 2004 International Symposium on
Print_ISBN
0-7803-8251-X
Type
conf
DOI
10.1109/ISCAS.2004.1329883
Filename
1329883
Link To Document