• DocumentCode
    700153
  • Title

    Under-determined speech separation using GMM-based non-linear beamforming

  • Author

    Dmour, Mohammad A. ; Davies, Michael E.

  • Author_Institution
    Inst. for Digital Commun. & Joint Res. Inst. for Signal & Image Process., Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2008
  • fDate
    25-29 Aug. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This paper introduces a frequency-domain non-linear beamformer that can perform speech source separation of under-determined mixtures, is reasonably artifact-free and does not require prior knowledge of the number of speakers. This beamformer utilises a Gaussian mixture distribution to model the observation probability density in each frequency bin, which can be learnt using the expectation maximisation (EM) algorithm. A linear minimum-variance distortionless response (MVDR) beamformer is determined for each of the Gaussian components. The proposed non-linear beamformer is then a weighted sum of these linear MVDR beamformers and is therefore also distortionless. The relative contribution for each linear MVDR beamformer is calculated as the posterior probability (specific to each time-frequency point) of its corresponding Gaussian component. Simulation results of the non-linear beamformer in under-determined mixtures with room reverberation confirm its ability to successfully separate speech sources with virtually no artifacts.
  • Keywords
    Gaussian distribution; Gaussian processes; array signal processing; expectation-maximisation algorithm; mixture models; source separation; speech processing; EM algorithm; GMM-based nonlinear beamforming; Gaussian components; Gaussian mixture distribution; expectation maximisation algorithm; frequency bin; frequency-domain non-linear beamformer; linear MVDR beamformers; linear minimum-variance distortionless response beamformer; observation probability density; posterior probability; room reverberation; time-frequency point; under-determined speech source separation mixtures; Array signal processing; Frequency-domain analysis; Microphones; Noise; Nonlinear distortion; Reverberation; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2008 16th European
  • Conference_Location
    Lausanne
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7080685