DocumentCode :
573573
Title :
MMSE speech enhancement using GMM
Author :
Chehresa, S. ; Savoji, M.H.
Author_Institution :
Electr. & Comput. Eng. Fac., Shahid Beheshti Univ., Tehran, Iran
fYear :
2012
fDate :
2-3 May 2012
Firstpage :
266
Lastpage :
271
Abstract :
A new and effective algorithm is proposed in this paper based on Gaussian Mixture Modeling (GMM) and Minimum Mean Square Error (MMSE) criterion for speech enhancement. GMM mean vectors are used to model the space span by the power spectra of the input noisy speech frames. No assumption is made on the nature or stationarity of the noise. No Voice Activity Detection (VAD) or any other means is used to estimate the input Signal to Noise Ratio (SNR). The mean vectors derived from mixture models of Power Spectral Densities (PSDs) of speech and different noise sources are used to form sets of over-determined system of equations, as many as noise source candidates, whose solutions lead to the MMSE estimations of speech and noise power spectra. These are then used for noise suppression by applying Wiener filtering carried out on overlapping frames. The input SNR is estimated and the nature of the noise involved is determined as by-products of the method used. Results are compared with those of two variants of a method based on approximate but explicit MMSE Bayesian estimation that show good results but suffer from long processing times. It is shown that, at the cost of a slight lower improvement in SNR and PESQ score, the new algorithm reduces the computation time to 1/30 which makes it suitable for practical applications.
Keywords :
Bayes methods; Gaussian noise; Wiener filters; least mean squares methods; speech enhancement; vectors; GMM mean vectors; Gaussian mixture modeling; MMSE speech enhancement; PESQ score; PSD; Wiener filtering; computation time reduction; explicit MMSE Bayesian estimation approximation; input SNR estimation; input noisy speech frames; minimum mean square error criterion; noise power spectral densities; noise sources; noise suppression; over-determined equation system; overlapping frames; power spectra; processing times; signal-to-noise ratio; space span modelling; Mathematical model; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Artificial Intelligence and Signal Processing (AISP), 2012 16th CSI International Symposium on
Conference_Location :
Shiraz, Fars
Print_ISBN :
978-1-4673-1478-7
Type :
conf
DOI :
10.1109/AISP.2012.6313756
Filename :
6313756
Link To Document :
بازگشت