Title :
Exploring permutation inconsistency in blind separation of speech signals in a reverberant environment
Author :
Ikram, Muhammad Z. ; Morgan, Dennis R.
Author_Institution :
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
Abstract :
We study and explore the limitations of methods for blind separation of a mixture of multiple speakers in a real reverberant environment. To support our results, we analyze a frequency-domain method, which achieves blind source separation (BSS) by transforming the time-domain convolutive problem to multiple short-term problems in the frequency domain. We show that treating the problem independently at different frequency bins introduces a “permutation inconsistency” problem, which becomes worse as the length of room impulse response increases. Our studies prove that the ideas proposed in the existing literature are not capable of effectively handling this problem and a need exists for its satisfactory solution. We speculate that time-domain BSS techniques may also suffer from an equivalent permutation inconsistency problem when long un-mixing filters are used
Keywords :
convolution; decorrelation; frequency-domain analysis; reverberation; speech processing; blind separation; blind source separation; frequency-domain method; long un-mixing filters; mixture of multiple speakers; multiple short-term problems; permutation inconsistency; reverberant environment; room impulse response; speech signals; time-domain convolutive problem; Filters; Frequency domain analysis; Image processing; Microphones; Signal processing; Source separation; Speech enhancement; Speech recognition; Telephony; Time domain analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.859141