DocumentCode :
3517755
Title :
A multistage approach for blind separation of convolutive speech mixtures
Author :
Jan, Tariqullah ; Wang, Wenwu ; Wang, DeLiang
Author_Institution :
Centre for Vision, Speech & Signal Process., Univ. of Surrey, Guildford
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
1713
Lastpage :
1716
Abstract :
In this paper, we propose a novel algorithm for the separation of convolutive speech mixtures using two-microphone recordings, based on the combination of independent component analysis (ICA) and ideal binary mask (IBM), together with a post-filtering process in the cepstral domain. Essentially, the proposed algorithm consists of three steps. First, a constrained convolutive ICA algorithm is applied to separate the source signals from two-microphone recordings. In the second step, we estimate the IBM by comparing the energy of corresponding time-frequency (T-F) units from the separated sources obtained with the convolutive ICA algorithm. The last step is to reduce musical noise caused typically by T-F masking using cepstral smoothing. The performance of the proposed approach is evaluated based on both reverberant mixtures generated using a simulated room model and real recordings. The proposed algorithm offers considerably higher efficiency, together with improved speech quality while producing similar separation performance as compared with a recent approach.
Keywords :
blind source separation; independent component analysis; smoothing methods; speech processing; T-F masking; blind separation; cepstral smoothing; convolutive speech mixtures; ideal binary mask; independent component analysis; multistage approach; musical noise; post-filtering process; two-microphone recordings; Acoustic noise; Cepstral analysis; Independent component analysis; Interference; Mathematical model; Signal processing algorithms; Smoothing methods; Speech analysis; Speech coding; Speech processing; Independent component analysis (ICA); cepstral smoothing; estimated binary mask; ideal binary mask (IBM); musical noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4959933
Filename :
4959933
Link To Document :
بازگشت