Title :
Multichannel Extensions of Non-Negative Matrix Factorization With Complex-Valued Data
Author :
Sawada, Hideyuki ; Kameoka, Hirokazu ; Araki, Shunsuke ; Ueda, Naonori
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
Abstract :
This paper presents new formulations and algorithms for multichannel extensions of non-negative matrix factorization (NMF). The formulations employ Hermitian positive semidefinite matrices to represent a multichannel version of non-negative elements. Multichannel Euclidean distance and multichannel Itakura-Saito (IS) divergence are defined based on appropriate statistical models utilizing multivariate complex Gaussian distributions. To minimize this distance/divergence, efficient optimization algorithms in the form of multiplicative updates are derived by using properly designed auxiliary functions. Two methods are proposed for clustering NMF bases according to the estimated spatial property. Convolutive blind source separation (BSS) is performed by the multichannel extensions of NMF with the clustering mechanism. Experimental results show that 1) the derived multiplicative update rules exhibited good convergence behavior, and 2) BSS tasks for several music sources with two microphones and three instrumental parts were evaluated successfully.
Keywords :
Gaussian distribution; Hermitian matrices; blind source separation; convergence of numerical methods; mathematical programming; matrix decomposition; microphones; statistical analysis; Hermitian positive semidefinite matrices; NMF; NMF bases clustering; appropriate statistical models; auxiliary functions; complex-valued data; convergence behavior; convolutive BSS; convolutive blind source separation; microphones; multichannel Euclidean distance; multichannel IS divergence; multichannel Itakura-Saito divergence; multichannel extensions; multichannel version; multiplicative updates; multivariate complex Gaussian distributions; nonnegative elements; nonnegative matrix factorization; optimization algorithms; three instrumental parts; Clustering algorithms; Conferences; Convergence; Euclidean distance; Matrix decomposition; Source separation; Time frequency analysis; Blind source separation; clustering; convolutive mixture; multichannel; non-negative matrix factorization;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2013.2239990