مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

573573

Title :

MMSE speech enhancement using GMM

Author :

Chehresa, S. ; Savoji, M.H.

Author_Institution :

Electr. & Comput. Eng. Fac., Shahid Beheshti Univ., Tehran, Iran

fYear :

2012

fDate :

2-3 May 2012

Firstpage :

266

Lastpage :

271

Abstract :

A new and effective algorithm is proposed in this paper based on Gaussian Mixture Modeling (GMM) and Minimum Mean Square Error (MMSE) criterion for speech enhancement. GMM mean vectors are used to model the space span by the power spectra of the input noisy speech frames. No assumption is made on the nature or stationarity of the noise. No Voice Activity Detection (VAD) or any other means is used to estimate the input Signal to Noise Ratio (SNR). The mean vectors derived from mixture models of Power Spectral Densities (PSDs) of speech and different noise sources are used to form sets of over-determined system of equations, as many as noise source candidates, whose solutions lead to the MMSE estimations of speech and noise power spectra. These are then used for noise suppression by applying Wiener filtering carried out on overlapping frames. The input SNR is estimated and the nature of the noise involved is determined as by-products of the method used. Results are compared with those of two variants of a method based on approximate but explicit MMSE Bayesian estimation that show good results but suffer from long processing times. It is shown that, at the cost of a slight lower improvement in SNR and PESQ score, the new algorithm reduces the computation time to 1/30 which makes it suitable for practical applications.

Keywords :

Bayes methods; Gaussian noise; Wiener filters; least mean squares methods; speech enhancement; vectors; GMM mean vectors; Gaussian mixture modeling; MMSE speech enhancement; PESQ score; PSD; Wiener filtering; computation time reduction; explicit MMSE Bayesian estimation approximation; input SNR estimation; input noisy speech frames; minimum mean square error criterion; noise power spectral densities; noise sources; noise suppression; over-determined equation system; overlapping frames; power spectra; processing times; signal-to-noise ratio; space span modelling; Mathematical model; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Artificial Intelligence and Signal Processing (AISP), 2012 16th CSI International Symposium on

Conference_Location :

Shiraz, Fars

Print_ISBN :

978-1-4673-1478-7

Type :

conf

DOI :

10.1109/AISP.2012.6313756

Filename :

6313756

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=573573