مرکز منطقه ای اطلاع رساني علوم و فناوري - A Versatile Framework for Speaker Separation Using a Model-Based Speaker Localization Approach

DocumentCode :

1414883

Title :

A Versatile Framework for Speaker Separation Using a Model-Based Speaker Localization Approach

Author :

Madhu, N. ; Martin, R.

Author_Institution :

Dept. of Neurosciences, Katholieke Univ. Leuven, Leuven, Belgium

Volume :

Issue :

fYear :

2011

Firstpage :

1900

Lastpage :

1912

Abstract :

We build upon our speaker localization framework developed in a previous work (N. Madhu and R. Martin, A scalable framework for multiple speaker localization and tracking,” in Proc. Int. Workshop Acoustic Echo Noise Control (IWAENC), Sep. 2008) to perform source separation. The proposed approach, exploiting the supplementary information from the mixture of Gaussians-based localization model, allows for the incorporation of a wide class of separation algorithms, from the nonlinear time-frequency mask-based approaches to a fully adaptive beamformer in the generalized sidelobe canceller (GSC) structure. We propose, in addition, a generalized estimation of the blocking matrix based on subspace projectors. The adaptive beamformer realized as proposed is insensitive to gain mismatches among the sensors, obviating the need for magnitude calibration of the microphones. It is also demonstrated that the proposed linear approach has a performance comparable to that of an optimal (oracle) GSC implementation. In comparison to ICA-based approaches, another advantage of the separation framework described herein is its robustness to ambient noise and scenarios with an unknown number of sources.

Keywords :

Gaussian distribution; array signal processing; estimation theory; microphones; source separation; speaker recognition; Gaussians-based localization; adaptive beamformer; ambient noise; blocking matrix; generalized estimation; generalized sidelobe canceller structure; magnitude calibration; microphones; multiple speaker localization; multiple speaker tracking; nonlinear time-frequency mask; source separation; speaker separation; subspace projectors; Adaptation model; Microphones; Noise; Noise measurement; Sensors; Speech; Time frequency analysis;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2010.2102754

Filename :

5677448

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1414883