DocumentCode :
1118153
Title :
Improved Subspace-Based Single-Channel Speech Enhancement Using Generalized Super-Gaussian Priors
Author :
Jensen, Jesper ; Heusdens, Richard
Author_Institution :
Dept. of Mediamatics, Delft Univ. of Technol.
Volume :
15
Issue :
3
fYear :
2007
fDate :
3/1/2007 12:00:00 AM
Firstpage :
862
Lastpage :
872
Abstract :
Traditional single-channel subspace-based schemes for speech enhancement rely mostly on linear minimum mean-square error estimators, which are globally optimal only if the Karhunen-Loeacuteve transform (KLT) coefficients of the noise and speech processes are Gaussian distributed. We derive in this paper subspace-based nonlinear estimators assuming that the speech KLT coefficients are distributed according to a generalized super-Gaussian distribution which has as special cases the Laplacian and the two-sided Gamma distribution. As with the traditional linear estimators, the derived estimators are functions of the a priori signal-to-noise ratio (SNR) in the subspaces spanned by the KLT transform vectors. We propose a scheme for estimating these a priori SNRs, which is in fact a generalization of the "decision-directed" approach which is well-known from short-time Fourier transform (STFT)-based enhancement schemes. We show that the proposed a priori SNR estimation scheme leads to a significant reduction of the residual noise level, a conclusion which is confirmed in extensive objective speech quality evaluations as well as subjective tests. We also show that the derived estimators based on the super-Gaussian KLT coefficient distribution lead to improvements for different noise sources and levels as compared to when a Gaussian assumption is imposed
Keywords :
Fourier transforms; Gaussian distribution; Karhunen-Loeve transforms; gamma distribution; least mean squares methods; nonlinear estimation; signal denoising; speech enhancement; Karhunen-Loeve transform coefficients; Laplacian distribution; a priori signal-to-noise ratio; decision-directed approach; generalized super-Gaussian distribution; linear minimum mean-square error estimators; objective speech quality evaluations; residual noise level reduction; short-time Fourier transform-based enhancement schemes; subspace-based nonlinear estimators; subspace-based single-channel speech enhancement; two-sided Gamma distribution; Fourier transforms; Gaussian noise; Karhunen-Loeve transforms; Laplace equations; Noise level; Signal to noise ratio; Speech analysis; Speech enhancement; Speech processing; Vectors; Minimum mean square error (MMSE) estimation; speech enhancement; subspace-based noise reduction;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2005.885939
Filename :
4100685
Link To Document :
بازگشت