Title :
A New Framework for Underdetermined Speech Extraction Using Mixture of Beamformers
Author :
Dmour, Mohammad A. ; Davies, Mike
Author_Institution :
Inst. for Digital Commun. (IDCOM), Edinburgh Univ., Edinburgh, UK
fDate :
3/1/2011 12:00:00 AM
Abstract :
This paper describes frequency-domain nonlinear mixture of beamformers that can extract a speech source from a known direction when there are fewer microphones than sources (the underdetermined case). Our approach models the data in each frequency bin via Gaussian mixture distributions, which can be learned using the expectation maximization algorithm. The model learning is performed using the observed mixture signals only, and no prior training is required. Nonlinear beamformers are then developed based on this model. The proposed estimators are a nonlinear weighted sum of linear minimum mean square error or minimum variance distortionless response beamformers. The resulting nonlinear beamformers do not need to know or estimate the number of sources, and can be applied to microphone arrays with two or more microphones. We test and evaluate the described methods on underdetermined speech mixtures.
Keywords :
Gaussian distribution; array signal processing; expectation-maximisation algorithm; feature extraction; mean square error methods; microphone arrays; speech processing; Gaussian mixture; expectation maximization algorithm; linear minimum mean square error; microphone arrays; minimum variance distortionless response beamformers; underdetermined speech extraction; Data mining; Frequency; Mean square error methods; Microphone arrays; Noise reduction; Nonlinear distortion; Permission; Signal processing; Speech enhancement; Testing; Beamforming; Gaussian mixture model (GMM); speech extraction; speech separation; underdetermined;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2010.2049514