• DocumentCode
    17140
  • Title

    Multichannel Source Separation and Tracking With RANSAC and Directional Statistics

  • Author

    Traa, Johannes ; Smaragdis, Paris

  • Author_Institution
    Electr. & Comput. Eng. Dept., Univ. of Illinois at Urbana-Champaign (UIUC), Urbana, IL, USA
  • Volume
    22
  • Issue
    12
  • fYear
    2014
  • fDate
    Dec. 2014
  • Firstpage
    2233
  • Lastpage
    2243
  • Abstract
    We describe multichannel blind source separation and tracking algorithms based on clustering wrapped interchannel phase difference (IPD) features. We pose the clustering problem as one of multimodal circular-linear regression and present its probabilistic formulation. Phase wrapping due to spatial aliasing is explicitly incorporated by modeling the IPD features as circular variables. We present two methods based on Expectation-Maximization (EM) and a sequential variant of RANdom SAmple Consensus (RANSAC). We show that their strengths can be combined by using RANSAC to initialize EM. The IPD clustering algorithm is applied to separate stationary speakers from a multichannel mixture. We then extend it to the case of moving speakers by tracking their directions-of-arrival with the Factorial Wrapped Kalman Filter (FWKF) using RANSAC as a data preprocessor. Experimental results demonstrate that the proposed methods perform well in the presence of reverberant babble noise and spatial aliasing. The FWKF successfully tracks and separates moving speakers with separation quality comparable to that for stationary speakers.
  • Keywords
    Kalman filters; blind source separation; direction-of-arrival estimation; expectation-maximisation algorithm; pattern clustering; probability; random processes; regression analysis; signal sampling; EM; FWKF; IPD feature; RANSAC; clustering wrapped interchannel phase difference feature; data preprocessor; directional statistics; directions-of-arrival tracking algorithm; expectation-maximization; factorial wrapped Kalman filter; multichannel blind source separation algorithm; multichannel mixture; multimodal circular-linear regression; phase wrapping; probabilistic formulation; random sample consensus; reverberant babble noise presence; spatial aliasing; stationary speaker separation; Arrays; Clustering algorithms; Microphones; Source separation; Speech; Speech processing; Vectors; Blind source separation (BSS); directional statistics; interchannel phase difference (IPD); wrapped Kalman filter;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2014.2365701
  • Filename
    6939657