DocumentCode :
179832
Title :
Non-negative source-filter dynamical system for speech enhancement
Author :
Simsekli, U. ; Le Roux, Jonathan ; Hershey, John R.
Author_Institution :
Dept. of Comput. Eng., Bogazici Univ., Istanbul, Turkey
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
6206
Lastpage :
6210
Abstract :
Model-based speech enhancement methods, which rely on separately modeling the speech and the noise, have been shown to be powerful in many different problem settings. When the structure of the noise can be arbitrary, which is often the case in practice, modelbased methods have to focus on developing good speech models, whose quality will be key to their performance. In this study, we propose a novel probabilistic model for speech enhancement which precisely models the speech by taking into account the underlying speech production process as well as its dynamics. The proposed model follows a source-filter approach where the excitation and filter parts are modeled as non-negative dynamical systems. We present convergence-guaranteed update rules for each latent factor. In order to assess performance, we evaluate our model on a challenging speech enhancement task where the speech is observed under non-stationary noises recorded in a car. We show that our model outperforms state-of-the-art methods in terms of objective measures.
Keywords :
filtering theory; probability; speech enhancement; nonnegative source-filter dynamical system; probabilistic model; speech enhancement; speech models; speech production process; Computational modeling; Hidden Markov models; Signal to noise ratio; Speech; Speech enhancement; non-negative dynamical system; non-negative matrix factorization; source separation; source-filter model; speech enhancement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854797
Filename :
6854797
Link To Document :
بازگشت