Title :
A Data-Driven Approach to A Priori SNR Estimation
Author :
Suhadi, Suhadi ; Last, Carsten ; Fingscheidt, Tim
Author_Institution :
Inst. for Commun. Technol., Tech. Univ. Braunschweig, Braunschweig, Germany
Abstract :
The a priori signal-to-noise ratio (SNR) plays an important role in many speech enhancement algorithms. In this paper, we present a data-driven approach to a priori SNR estimation. It may be used with a wide range of speech enhancement techniques, such as, e.g., the minimum mean square error (MMSE) (log) spectral amplitude estimator, the super Gaussian joint maximum a posteriori (JMAP) estimator, or the Wiener filter. The proposed SNR estimator employs two trained artificial neural networks, one for speech presence, one for speech absence. The classical decision-directed a priori SNR estimator by Ephraim and Malah is broken down into its two additive components, which now represent the two input signals to the neural networks. Both output nodes are combined to represent the new a priori SNR estimate. As an alternative to the neural networks, also simple lookup tables are investigated. Employment of these data-driven nonlinear a priori SNR estimators reduces speech distortion, particularly in speech onset, while retaining a high level of noise attenuation in speech absence.
Keywords :
Gaussian processes; Wiener filters; interference suppression; least mean squares methods; maximum likelihood estimation; neural nets; signal denoising; spectral analysis; speech enhancement; table lookup; Wiener filter; artificial neural networks; classical decision-directed a priori SNR estimator; data-driven nonlinear a priori SNR estimation; lookup tables; minimum mean square error spectral amplitude estimator; neural networks; noise attenuation; speech distortion reduction; speech enhancement algorithms; super Gaussian joint maximum a posteriori estimator; Amplitude estimation; Artificial neural networks; Employment; Mean square error methods; Neural networks; Nonlinear distortion; Signal to noise ratio; Speech enhancement; Table lookup; Wiener filter; Data-driven approach; decision-directed (DD) a priori signal-to-noise ratio (SNR) estimation; noise reduction; speech enhancement;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2010.2045799