DocumentCode
9722
Title
Multichannel Extensions of Non-Negative Matrix Factorization With Complex-Valued Data
Author
Sawada, Hideyuki ; Kameoka, Hirokazu ; Araki, Shunsuke ; Ueda, Naonori
Author_Institution
NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
Volume
21
Issue
5
fYear
2013
fDate
May-13
Firstpage
971
Lastpage
982
Abstract
This paper presents new formulations and algorithms for multichannel extensions of non-negative matrix factorization (NMF). The formulations employ Hermitian positive semidefinite matrices to represent a multichannel version of non-negative elements. Multichannel Euclidean distance and multichannel Itakura-Saito (IS) divergence are defined based on appropriate statistical models utilizing multivariate complex Gaussian distributions. To minimize this distance/divergence, efficient optimization algorithms in the form of multiplicative updates are derived by using properly designed auxiliary functions. Two methods are proposed for clustering NMF bases according to the estimated spatial property. Convolutive blind source separation (BSS) is performed by the multichannel extensions of NMF with the clustering mechanism. Experimental results show that 1) the derived multiplicative update rules exhibited good convergence behavior, and 2) BSS tasks for several music sources with two microphones and three instrumental parts were evaluated successfully.
Keywords
Gaussian distribution; Hermitian matrices; blind source separation; convergence of numerical methods; mathematical programming; matrix decomposition; microphones; statistical analysis; Hermitian positive semidefinite matrices; NMF; NMF bases clustering; appropriate statistical models; auxiliary functions; complex-valued data; convergence behavior; convolutive BSS; convolutive blind source separation; microphones; multichannel Euclidean distance; multichannel IS divergence; multichannel Itakura-Saito divergence; multichannel extensions; multichannel version; multiplicative updates; multivariate complex Gaussian distributions; nonnegative elements; nonnegative matrix factorization; optimization algorithms; three instrumental parts; Clustering algorithms; Conferences; Convergence; Euclidean distance; Matrix decomposition; Source separation; Time frequency analysis; Blind source separation; clustering; convolutive mixture; multichannel; non-negative matrix factorization;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2013.2239990
Filename
6410389
Link To Document