Title :
Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation
Author :
Nikunen, Joona ; Virtanen, Tuomas
Author_Institution :
Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
Abstract :
This paper addresses the problem of sound source separation from a multichannel microphone array capture via estimation of source spatial covariance matrix (SCM) of a short-time Fourier transformed mixture signal. In many conventional audio separation algorithms the source mixing parameter estimation is done separately for each frequency thus making them prone to errors and leading to suboptimal source estimates. In this paper we propose a SCM model which consists of a weighted sum of direction of arrival (DoA) kernels and estimate only the weights dependent on the source directions. In the proposed algorithm, the spatial properties of the sources become jointly optimized over all frequencies, leading to more coherent source estimates and mitigating the effect of spatial aliasing at high frequencies. The proposed SCM model is combined with a linear model for magnitudes and the parameter estimation is formulated in a complex-valued non-negative matrix factorization (CNMF) framework. Simulations consist of recordings done with a hand-held device sized array having multiple microphones embedded inside the device casing. Separation quality of the proposed algorithm is shown to exceed the performance of existing state of the art separation methods with two sources when evaluated by objective separation quality metrics.
Keywords :
Fourier transforms; array signal processing; audio signal processing; blind source separation; covariance matrices; direction-of-arrival estimation; matrix decomposition; microphone arrays; CNMF framework; DoA kernels; SCM model; array signal processing; audio separation algorithms; blind sound source separation; complex-valued nonnegative matrix factorization framework; device casing; direction of arrival based spatial covariance model; handheld device sized array; linear model; multichannel microphone array; objective separation quality metrics; short-time Fourier transformed mixture signal; source mixing parameter estimation; source spatial covariance matrix estimation; spatial aliasing effect mitigation; spatial property; suboptimal source estimates; weighted sum of direction of arrival kernels; Arrays; Direction-of-arrival estimation; Frequency estimation; Kernel; Mathematical model; Microphones; Source separation; Array signal processing; direction of arrival estimation; multichannel source separation; non-negative matrix factorization; spatial covariance models;
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
DOI :
10.1109/TASLP.2014.2303576