DocumentCode
1187231
Title
Permutation inconsistency in blind speech separation: investigation and solutions
Author
Ikram, Muhammad Z. ; Morgan, Dennis R.
Author_Institution
DSP Solutions R&D Center, Texas Instrum. Inc., Dallas, TX, USA
Volume
13
Issue
1
fYear
2005
Firstpage
1
Lastpage
13
Abstract
Acoustic reverberation severely limits the performance of multiple microphone blind speech separation (BSS) methods. We show that the limited performance is due to random permutations of the unmixing filters over frequency. This problem, which we refer to as permutation inconsistency, becomes worse as the length of the room impulse response increases. We explore interesting connections between BSS and ideal beamforming, which leads us to propose a permutation alignment scheme based on microphone array directivity patterns. Given that the permutations are properly aligned, we show that the blind speech separation method outperforms the nonblind beamformer in a highly reverberant environment. Furthermore, we discover the tradeoff where permutations can be aligned by affording a loss in spectral resolution of the unmixing filters. We then propose a multistage algorithm, which aligns the unmixing filter permutations without sacrificing the spectral resolution. For our study, we perform experiments in both real and simulated environments and compare the results to the ideal performance benchmarks that we derive using prior knowledge of the mixing filters.
Keywords
blind source separation; filters; reverberation; speech processing; transient response; acoustic reverberation; ideal beamforming; microphone array directivity pattern; multiple microphone blind speech separation; multistage algorithm; permutation alignment scheme; permutation inconsistency; random permutation; room impulse response; spectral resolution; unmixing filter; unmixing filter permutation; Acoustics; Array signal processing; Filters; Frequency; Helium; Microphone arrays; Reverberation; Signal resolution; Source separation; Speech enhancement;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/TSA.2004.834441
Filename
1369307
Link To Document