DocumentCode :
2613128
Title :
Performance analysis of ideal binary masks in speech enhancement
Author :
Jiang, Yi ; Zhou, Hong ; Feng, Zhenming
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Volume :
5
fYear :
2011
fDate :
15-17 Oct. 2011
Firstpage :
2422
Lastpage :
2425
Abstract :
Binary masks are essential elements to be used in monaural speech segregation and hearing aids. The performances of the ideal binary masks in terms of signal to noise ratio were evaluated in this article, and a method to predict it before application was proposed. Within the framework of the computational auditory scene analysis (CASA), Ideal binary mask (IBM) has the optimum performance in time-frequency (T-F) units. It can be used as an object goal in global level, which was confirmed by the experiments on a speech mixture database. Furthermore, energy distribution of the target and interfere signals were used together to estimate the performance of IBM in mixture separation.
Keywords :
hearing aids; speech enhancement; computational auditory scene analysis; hearing aids; ideal binary mask; mixture separation; monaural speech segregation; performance analysis; signal to noise ratio; speech enhancement; speech mixture database; time-frequency units; Amplitude modulation; Image analysis; Signal to noise ratio; Speech; Speech recognition; Time frequency analysis; Ideal binary masks (IBM); computionanl auditory scene analysis (CASA); monaural speech segregation; time-frequency(T-F) units;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image and Signal Processing (CISP), 2011 4th International Congress on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-9304-3
Type :
conf
DOI :
10.1109/CISP.2011.6100732
Filename :
6100732
Link To Document :
بازگشت