A Joint Probabilistic-Deterministic Approach using Source-Filter Modeling of Speech Signal for Single Channel Speech Separation

Author

Radfar, M.H. ; Dansereau, R.M. ; Sayadiyan, A.

Author_Institution

Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, ON

fYear

2006

fDate

6-8 Sept. 2006

Firstpage

47

Lastpage

52

Abstract

In this paper, we present a new technique for separating two speech signals from a single recording. For this purpose, we decompose the speech signal into the excitation signal and the vocal tract function and then estimate the components from the mixed speech using a hybrid model. We first express the probability density function (PDF) of the mixed speech´s log spectral vectors in terms of the PDFs of the underlying speech signal´s vocal tract functions. Then, the mean vectors of PDFs of the vocal tract functions are obtained using a Maximum Likelihood estimator given the mixed signal. Finally, the estimated vocal tract function along with the extracted pitch values are used to reconstruct estimates of the individual speech signals. We compare our model with both an underdetermined blind source separation and a CASA method. The experimental results show our model outperforms both techniques in terms of SNR improvement and the percentage of crosstalk suppression.

Keywords

filtering theory; maximum likelihood estimation; probability; signal reconstruction; source separation; spectral analysis; speech processing; excitation signal; joint probabilistic-deterministic approach; maximum likelihood estimation; mixed speech log spectral vector; probability density function; single channel speech separation; speech signal reconstruction; speech signal source-filter modeling; speech single recording; vocal tract function estimation; Blind source separation; Crosstalk; Fourier transforms; Frequency domain analysis; Maximum likelihood estimation; Probability density function; Psychoacoustic models; Source separation; Speech; Systems engineering and theory;

fLanguage

English

Publisher

ieee

Conference_Titel

Machine Learning for Signal Processing, 2006. Proceedings of the 2006 16th IEEE Signal Processing Society Workshop on

Conference_Location

Arlington, VA

ISSN

1551-2541

Print_ISBN

1-4244-0656-0

Electronic_ISBN

1551-2541

Type

conf

DOI

10.1109/MLSP.2006.275520

Filename

4053619