Title :
Subband based blind source separation for convolutive mixtures of speech
Author :
Araki, Shoko ; Makino, Shoji ; Aichner, Robert ; Nishikawa, Tsuyoki ; Saruwatari, Hiroshi
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
Abstract :
Subband processing is applied to blind source separation (BSS) for convolutive mixtures of speech. This is motivated by the drawback of frequency-domain BSS, i.e., when a long frame with a fixed frame-shift is used to cover reverberation, the number of samples in each frequency decreases and the separation performance is degraded. In our proposed subband BSS, (1) by using a moderate number of subbands, a sufficient number of samples can be held in each subband, mid (2) by using FIR filters in each subband, we can handle long reverberation. Subband BSS achieves better performance than frequency-domain BSS. Moreover, we propose efficient separation procedures that take into consideration the frequency characteristics of room reverberation and speech signals. We achieve this (3) by using longer unmixing filters in low frequency bands, and (4) by adopting overlap-blockshift in BSS´s batch adaptation in low frequency bands. Consequently, frequency-dependent subband processing is successfully realized in the proposed subband BSS.
Keywords :
FIR filters; blind source separation; convolution; reverberation; signal sampling; speech processing; speech recognition; BSS; FIR filters; batch adaptation; blind source separation; convolutive speech mixtures; frequency characteristics; frequency-dependent subband processing; long reverberation; overlap-blockshift; performance; room reverberation; samples; speech signals; unmixing filters; Blind source separation; Degradation; Finite impulse response filter; Frequency estimation; Information science; Laboratories; Microphones; Reverberation; Source separation; Speech processing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1200018