مرکز منطقه ای اطلاع رساني علوم و فناوري - Single-Channel Speech Separation Using Soft Mask Filtering

DocumentCode :

1224425

Title :

Single-Channel Speech Separation Using Soft Mask Filtering

Author :

Radfar, Mohammad H. ; Dansereau, Richard M.

Author_Institution :

Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, ON

Volume :

Issue :

fYear :

2007

Firstpage :

2299

Lastpage :

2310

Abstract :

We present an approach for separating two speech signals when only one single recording of their linear mixture is available. For this purpose, we derive a filter, which we call the soft mask filter, using minimum mean square error (MMSE) estimation of the log spectral vectors of sources given the mixture´s log spectral vectors. The soft mask filter´s parameters are estimated using the mean and variance of the underlying sources which are modeled using the Gaussian composite source modeling (CSM) approach. It is also shown that the binary mask filter which has been empirically and extensively used in single-channel speech separation techniques is, in fact, a simplified form of the soft mask filter. The soft mask filtering technique is compared with the binary mask and Wiener filtering approaches when the input consists of male+male, female+female, and male+female mixtures. The experimental results in terms of signal-to-noise ratio (SNR) and segmental SNR show that soft mask filtering outperforms binary mask and Wiener filtering.

Keywords :

Gaussian processes; estimation theory; filtering theory; least mean squares methods; source separation; speech processing; Gaussian composite source modeling; Wiener filtering; binary mask filtering; log spectral vector; minimum mean square error estimation; parameter estimation; single-channel speech signal separation; soft mask filtering; Estimation error; Filtering; Independent component analysis; Mean square error methods; Source separation; Speech coding; Speech enhancement; Speech processing; Vectors; Wiener filter; Mask filtering; minimum mean square error (MMSE) estimation; single-channel speech separation; source separation; speech enhancement;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2007.904233

Filename :

4317574

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1224425