Title :
A CASA-Based System for Long-Term SNR Estimation
Author :
Narayanan, Arun ; Wang, DeLiang
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Abstract :
We present a system for robust signal-to-noise ratio (SNR) estimation based on computational auditory scene analysis (CASA). The proposed algorithm uses an estimate of the ideal binary mask to segregate a time-frequency representation of the noisy signal into speech dominated and noise dominated regions. Energy within each of these regions is summated to derive the filtered global SNR. An SNR transform is introduced to convert the estimated filtered SNR to the true broadband SNR of the noisy signal. The algorithm is further extended to estimate subband SNRs. Evaluations are done using the TIMIT speech corpus and the NOISEX92 noise database. Results indicate that both global and subband SNR estimates are superior to those of existing methods, especially at low SNR conditions.
Keywords :
filtering theory; signal representation; speech processing; time-frequency analysis; CASA-based system; NOISEX92 noise database; SNR transform; TIMIT speech corpus; computational auditory scene analysis; ideal binary mask estimation; long-term SNR estimation; noise dominated regions; noisy signal; signal-to-noise ratio estimation; speech dominated region; time-frequency representation; Broadband communication; Estimation; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Computational auditory scene analysis (CASA); broadband SNR; ideal binary mask (IBM); signal-to-noise ratio (SNR); subband SNR;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2012.2205242