DocumentCode
17140
Title
Multichannel Source Separation and Tracking With RANSAC and Directional Statistics
Author
Traa, Johannes ; Smaragdis, Paris
Author_Institution
Electr. & Comput. Eng. Dept., Univ. of Illinois at Urbana-Champaign (UIUC), Urbana, IL, USA
Volume
22
Issue
12
fYear
2014
fDate
Dec. 2014
Firstpage
2233
Lastpage
2243
Abstract
We describe multichannel blind source separation and tracking algorithms based on clustering wrapped interchannel phase difference (IPD) features. We pose the clustering problem as one of multimodal circular-linear regression and present its probabilistic formulation. Phase wrapping due to spatial aliasing is explicitly incorporated by modeling the IPD features as circular variables. We present two methods based on Expectation-Maximization (EM) and a sequential variant of RANdom SAmple Consensus (RANSAC). We show that their strengths can be combined by using RANSAC to initialize EM. The IPD clustering algorithm is applied to separate stationary speakers from a multichannel mixture. We then extend it to the case of moving speakers by tracking their directions-of-arrival with the Factorial Wrapped Kalman Filter (FWKF) using RANSAC as a data preprocessor. Experimental results demonstrate that the proposed methods perform well in the presence of reverberant babble noise and spatial aliasing. The FWKF successfully tracks and separates moving speakers with separation quality comparable to that for stationary speakers.
Keywords
Kalman filters; blind source separation; direction-of-arrival estimation; expectation-maximisation algorithm; pattern clustering; probability; random processes; regression analysis; signal sampling; EM; FWKF; IPD feature; RANSAC; clustering wrapped interchannel phase difference feature; data preprocessor; directional statistics; directions-of-arrival tracking algorithm; expectation-maximization; factorial wrapped Kalman filter; multichannel blind source separation algorithm; multichannel mixture; multimodal circular-linear regression; phase wrapping; probabilistic formulation; random sample consensus; reverberant babble noise presence; spatial aliasing; stationary speaker separation; Arrays; Clustering algorithms; Microphones; Source separation; Speech; Speech processing; Vectors; Blind source separation (BSS); directional statistics; interchannel phase difference (IPD); wrapped Kalman filter;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher
ieee
ISSN
2329-9290
Type
jour
DOI
10.1109/TASLP.2014.2365701
Filename
6939657
Link To Document