DocumentCode :
3297382
Title :
Novel Binaural Spectro-temporal Algorithm for Speech Enhancement in Low SNR Environments
Author :
Sung, Po-Hsun ; Chen, Bo-Wei ; Jang, Ling-Sheng ; Wang, Jhing-Fa
Author_Institution :
Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
fYear :
2012
fDate :
9-13 July 2012
Firstpage :
1021
Lastpage :
1026
Abstract :
A novel BInaural Spectro-Temporal (BIST) algorithm is proposed in this paper to increase the speech intelligibility in low or negative SNR noisy environments. The BIST algorithm consists of two modules. One is the spatial mask for receiving sound from the specific direction, and the other is the spectro-temporal modulation filter for noise reduction. Most speech enhancement algorithms are not applicable in harsh environments because the energy of speech is covered by the noise. To increase the speech intelligibility in low or negative SNR noisy environments, a distinctive approach is proposed to solve this problem. First, the BIST algorithm takes binaural auditory processing as a spatial mask to separate the speech and noise according to their locations. Next, the modulation filter is applied to reduce the noise source in the scale-rate (spectro-temporal modulation) domain according to their different acoustic feature. It works like the spectro-temporal receptive field (STRF) which is the perception response of human auditory cortex. The experimental results demonstrate that the proposed BIST speech enhancement algorithm can improve 20% from the noisy speech at SNR-10dB.
Keywords :
interference suppression; modulation; speech enhancement; speech intelligibility; BIST algorithm; SNR noisy environments; binaural auditory processing; binaural spectro-temporal algorithm; human auditory cortex; low SNR environments; noise reduction; spatial mask; spectro-temporal modulation filter; spectro-temporal receptive field; speech enhancement; speech intelligibility; Acoustics; Built-in self-test; Modulation; Noise; Noise measurement; Speech; Speech enhancement; binaural processing; cochleagram; low SNR; noisy environments; spectro-temporal modulation; speech intelligibility;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2012 IEEE International Conference on
Conference_Location :
Melbourne, VIC
ISSN :
1945-7871
Print_ISBN :
978-1-4673-1659-0
Type :
conf
DOI :
10.1109/ICME.2012.40
Filename :
6298537
Link To Document :
بازگشت