Title :
Blind speech separation exploiting temporal and spectral correlations using 2D-HMMs
Author :
Dang Hai Tran Vu ; Haeb-Umbach, Reinhold
Author_Institution :
Dept. of Commun. Eng., Univ. of Paderborn, Paderborn, Germany
Abstract :
We present a novel method to exploit correlations of adjacent time-frequency (TF)-slots for a sparseness-based blind speech separation (BSS) system. Usually, these correlations are exploited by some heuristic smoothing techniques in the post-processing of the estimated soft TF masks. We propose a different approach: Based on our previous work with one-dimensional (1D)-hidden Markov models (HMMs) along the time axis we extend the modeling to two-dimensional (2D)-HMMs to exploit both temporal and spectral correlations in the speech signal. Based on the principles of turbo decoding we solved the complex inference of 2D-HMMs by a modified forward-backward algorithm which operates alternatingly along the time and the frequency axis. Extrinsic information is exchanged between these steps such that increasingly better soft time-frequency masks are obtained, leading to improved speech separation performance in highly reverberant recording conditions.
Keywords :
blind source separation; correlation methods; hidden Markov models; smoothing methods; speech coding; turbo codes; 1D-hidden Markov models; 2D-HMM; BSS system; adjacent time-frequency slots; complex inference; forward-backward algorithm; frequency axis; heuristic smoothing techniques; reverberant recording conditions; soft TF mask estimation; soft time-frequency masks; sparseness-based blind speech separation system; spectral correlations; speech signal; temporal correlations; time axis; turbo decoding principles; Correlation; Hidden Markov models; Signal to noise ratio; Speech; Time-frequency analysis; Vectors;
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2013 Proceedings of the 21st European
Conference_Location :
Marrakech