Title :
Performance analysis of ideal binary masks in speech enhancement
Author :
Jiang, Yi ; Zhou, Hong ; Feng, Zhenming
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Abstract :
Binary masks are essential elements to be used in monaural speech segregation and hearing aids. The performances of the ideal binary masks in terms of signal to noise ratio were evaluated in this article, and a method to predict it before application was proposed. Within the framework of the computational auditory scene analysis (CASA), Ideal binary mask (IBM) has the optimum performance in time-frequency (T-F) units. It can be used as an object goal in global level, which was confirmed by the experiments on a speech mixture database. Furthermore, energy distribution of the target and interfere signals were used together to estimate the performance of IBM in mixture separation.
Keywords :
hearing aids; speech enhancement; computational auditory scene analysis; hearing aids; ideal binary mask; mixture separation; monaural speech segregation; performance analysis; signal to noise ratio; speech enhancement; speech mixture database; time-frequency units; Amplitude modulation; Image analysis; Signal to noise ratio; Speech; Speech recognition; Time frequency analysis; Ideal binary masks (IBM); computionanl auditory scene analysis (CASA); monaural speech segregation; time-frequency(T-F) units;
Conference_Titel :
Image and Signal Processing (CISP), 2011 4th International Congress on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-9304-3
DOI :
10.1109/CISP.2011.6100732