DocumentCode :
700153
Title :
Under-determined speech separation using GMM-based non-linear beamforming
Author :
Dmour, Mohammad A. ; Davies, Michael E.
Author_Institution :
Inst. for Digital Commun. & Joint Res. Inst. for Signal & Image Process., Univ. of Edinburgh, Edinburgh, UK
fYear :
2008
fDate :
25-29 Aug. 2008
Firstpage :
1
Lastpage :
5
Abstract :
This paper introduces a frequency-domain non-linear beamformer that can perform speech source separation of under-determined mixtures, is reasonably artifact-free and does not require prior knowledge of the number of speakers. This beamformer utilises a Gaussian mixture distribution to model the observation probability density in each frequency bin, which can be learnt using the expectation maximisation (EM) algorithm. A linear minimum-variance distortionless response (MVDR) beamformer is determined for each of the Gaussian components. The proposed non-linear beamformer is then a weighted sum of these linear MVDR beamformers and is therefore also distortionless. The relative contribution for each linear MVDR beamformer is calculated as the posterior probability (specific to each time-frequency point) of its corresponding Gaussian component. Simulation results of the non-linear beamformer in under-determined mixtures with room reverberation confirm its ability to successfully separate speech sources with virtually no artifacts.
Keywords :
Gaussian distribution; Gaussian processes; array signal processing; expectation-maximisation algorithm; mixture models; source separation; speech processing; EM algorithm; GMM-based nonlinear beamforming; Gaussian components; Gaussian mixture distribution; expectation maximisation algorithm; frequency bin; frequency-domain non-linear beamformer; linear MVDR beamformers; linear minimum-variance distortionless response beamformer; observation probability density; posterior probability; room reverberation; time-frequency point; under-determined speech source separation mixtures; Array signal processing; Frequency-domain analysis; Microphones; Noise; Nonlinear distortion; Reverberation; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne
ISSN :
2219-5491
Type :
conf
Filename :
7080685
Link To Document :
بازگشت