Title :
A speech preprocessing strategy for intelligibility improvement in noise based on a perceptual distortion measure
Author :
Taal, Cees H. ; Hendriks, Richard C. ; Heusdens, Richard
Author_Institution :
Signal & Inf. Process. Lab., Delft Univ. of Technol., Delft, Netherlands
Abstract :
A speech pre-processing algorithm is presented to improve the speech intelligibility in noise for the near-end listener. The algorithm improves the intelligibility by optimally redistributing the speech energy over time and frequency for a perceptual distortion measure, which is based on a spectro-temporal auditory model. In contrast to spectral-only models, short-time information is taken into account. As a consequence, the algorithm is more sensitive to transient regions, which will therefore receive more amplification compared to stationary vowels. It is known from literature that changing the vowel-transient energy ratio is beneficial for improving speech-intelligibility in noise. Objective intelligibility prediction results show that the proposed method has higher speech intelligibility in noise compared to two other reference methods, without modifying the global speech energy.
Keywords :
speech intelligibility; speech processing; global speech energy; near-end listener; objective intelligibility prediction; perceptual distortion measure; spectral-only models; spectro-temporal auditory model; speech intelligibility; speech preprocessing strategy; stationary vowels; transient regions; vowel-transient energy ratio; Cutoff frequency; Distortion measurement; Frequency measurement; Noise; Noise measurement; Speech; Speech processing; Near-end speech enhancement; intelligibility improvement; transients;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288810