DocumentCode
3517755
Title
A multistage approach for blind separation of convolutive speech mixtures
Author
Jan, Tariqullah ; Wang, Wenwu ; Wang, DeLiang
Author_Institution
Centre for Vision, Speech & Signal Process., Univ. of Surrey, Guildford
fYear
2009
fDate
19-24 April 2009
Firstpage
1713
Lastpage
1716
Abstract
In this paper, we propose a novel algorithm for the separation of convolutive speech mixtures using two-microphone recordings, based on the combination of independent component analysis (ICA) and ideal binary mask (IBM), together with a post-filtering process in the cepstral domain. Essentially, the proposed algorithm consists of three steps. First, a constrained convolutive ICA algorithm is applied to separate the source signals from two-microphone recordings. In the second step, we estimate the IBM by comparing the energy of corresponding time-frequency (T-F) units from the separated sources obtained with the convolutive ICA algorithm. The last step is to reduce musical noise caused typically by T-F masking using cepstral smoothing. The performance of the proposed approach is evaluated based on both reverberant mixtures generated using a simulated room model and real recordings. The proposed algorithm offers considerably higher efficiency, together with improved speech quality while producing similar separation performance as compared with a recent approach.
Keywords
blind source separation; independent component analysis; smoothing methods; speech processing; T-F masking; blind separation; cepstral smoothing; convolutive speech mixtures; ideal binary mask; independent component analysis; multistage approach; musical noise; post-filtering process; two-microphone recordings; Acoustic noise; Cepstral analysis; Independent component analysis; Interference; Mathematical model; Signal processing algorithms; Smoothing methods; Speech analysis; Speech coding; Speech processing; Independent component analysis (ICA); cepstral smoothing; estimated binary mask; ideal binary mask (IBM); musical noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4959933
Filename
4959933
Link To Document