Title :
Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty
Author :
Lu, Yang ; Loizou, Philipos C.
Author_Institution :
Cirrus Logic, Inc., Austin, TX, USA
fDate :
7/1/2011 12:00:00 AM
Abstract :
Statistical estimators of the magnitude-squared spectrum are derived based on the assumption that the magnitude-squared spectrum of the noisy speech signal can be computed as the sum of the (clean) signal and noise magnitude-squared spectra. Maximum a posterior (MAP) and minimum mean square error (MMSE) estimators are derived based on a Gaussian statistical model. The gain function of the MAP estimator was found to be identical to the gain function used in the ideal binary mask (IdBM) that is widely used in computational auditory scene analysis (CASA). As such, it was binary and assumed the value of 1 if the local signal-to-noise ratio (SNR) exceeded 0 dB, and assumed the value of 0 otherwise. By modeling the local instantaneous SNR as an F-distributed random variable, soft masking methods were derived incorporating SNR uncertainty. The soft masking method, in particular, which weighted the noisy magnitude-squared spectrum by the a priori probability that the local SNR exceeds 0 dB was shown to be identical to the Wiener gain function. Results indicated that the proposed estimators yielded significantly better speech quality than the conventional minimum mean square error spectral power estimators, in terms of yielding lower residual noise and lower speech distortion.
Keywords :
maximum likelihood estimation; mean square error methods; signal processing; speech processing; Gaussian statistical model; SNR uncertainty; Wiener gain function; computational auditory scene analysis; magnitude squared spectrum estimation; minimum mean square error estimator; speech distortion; speech signal; Approximation methods; Gain; Noise measurement; Signal to noise ratio; Speech; Speech processing; Binary mask; maximum a posterior (MAP) estimators; minimum mean square error (MMSE) estimators; soft mask; speech enhancement;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2010.2082531