مرکز منطقه ای اطلاع رساني علوم و فناوري - Integrated Speech Enhancement Method Using Noise Suppression and Dereverberation

DocumentCode :

1066756

Title :

Integrated Speech Enhancement Method Using Noise Suppression and Dereverberation

Author :

Yoshioka, Takuya ; Nakatani, Tomohiro ; Miyoshi, Masato

Author_Institution :

NTT Commun. Sci. Labs., Nippon Telegraph & Telephone Corp., Kyoto

Volume :

Issue :

fYear :

2009

Firstpage :

231

Lastpage :

246

Abstract :

This paper proposes a method for enhancing speech signals contaminated by room reverberation and additive stationary noise. The following conditions are assumed. 1) Short-time spectral components of speech and noise are statistically independent Gaussian random variables. 2) A room´s convolutive system is modeled as an autoregressive system in each frequency band. 3) A short-time power spectral density of speech is modeled as an all-pole spectrum, while that of noise is assumed to be time-invariant and known in advance. Under these conditions, the proposed method estimates the parameters of the convolutive system and those of the all-pole speech model based on the maximum likelihood estimation method. The estimated parameters are then used to calculate the minimum mean square error estimates of the speech spectral components. The proposed method has two significant features. 1) The parameter estimation part performs noise suppression and dereverberation alternately. (2) Noise-free reverberant speech spectrum estimates, which are transferred by the noise suppression process to the dereverberation process, are represented in the form of a probability distribution. This paper reports the experimental results of 1500 trials conducted using 500 different utterances. The reverberation time RT₆₀ was 0.6 s, and the reverberant signal to noise ratio was 20, 15, or 10 dB. The experimental results show the superiority of the proposed method over the sequential performance of the noise suppression and dereverberation processes.

Keywords :

Gaussian processes; interference suppression; least mean squares methods; maximum likelihood estimation; reverberation; speech enhancement; Gaussian random variables; additive stationary noise; maximum likelihood estimation; minimum mean square error estimation; noise suppression; parameter estimation; power spectral density; room deverberation; speech enhancement; speech spectrum estimation; Additive noise; Frequency; Gaussian noise; Maximum likelihood estimation; Mean square error methods; Parameter estimation; Power system modeling; Random variables; Reverberation; Speech enhancement; Dereverberation; maximum-likelihood (ML) estimation; minimum mean square error (MMSE) estimation; noise suppression; speech enhancement;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2008.2008042

Filename :

4749471

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1066756