• DocumentCode
    1187231
  • Title

    Permutation inconsistency in blind speech separation: investigation and solutions

  • Author

    Ikram, Muhammad Z. ; Morgan, Dennis R.

  • Author_Institution
    DSP Solutions R&D Center, Texas Instrum. Inc., Dallas, TX, USA
  • Volume
    13
  • Issue
    1
  • fYear
    2005
  • Firstpage
    1
  • Lastpage
    13
  • Abstract
    Acoustic reverberation severely limits the performance of multiple microphone blind speech separation (BSS) methods. We show that the limited performance is due to random permutations of the unmixing filters over frequency. This problem, which we refer to as permutation inconsistency, becomes worse as the length of the room impulse response increases. We explore interesting connections between BSS and ideal beamforming, which leads us to propose a permutation alignment scheme based on microphone array directivity patterns. Given that the permutations are properly aligned, we show that the blind speech separation method outperforms the nonblind beamformer in a highly reverberant environment. Furthermore, we discover the tradeoff where permutations can be aligned by affording a loss in spectral resolution of the unmixing filters. We then propose a multistage algorithm, which aligns the unmixing filter permutations without sacrificing the spectral resolution. For our study, we perform experiments in both real and simulated environments and compare the results to the ideal performance benchmarks that we derive using prior knowledge of the mixing filters.
  • Keywords
    blind source separation; filters; reverberation; speech processing; transient response; acoustic reverberation; ideal beamforming; microphone array directivity pattern; multiple microphone blind speech separation; multistage algorithm; permutation alignment scheme; permutation inconsistency; random permutation; room impulse response; spectral resolution; unmixing filter; unmixing filter permutation; Acoustics; Array signal processing; Filters; Frequency; Helium; Microphone arrays; Reverberation; Signal resolution; Source separation; Speech enhancement;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2004.834441
  • Filename
    1369307