• DocumentCode
    9722
  • Title

    Multichannel Extensions of Non-Negative Matrix Factorization With Complex-Valued Data

  • Author

    Sawada, Hideyuki ; Kameoka, Hirokazu ; Araki, Shunsuke ; Ueda, Naonori

  • Author_Institution
    NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
  • Volume
    21
  • Issue
    5
  • fYear
    2013
  • fDate
    May-13
  • Firstpage
    971
  • Lastpage
    982
  • Abstract
    This paper presents new formulations and algorithms for multichannel extensions of non-negative matrix factorization (NMF). The formulations employ Hermitian positive semidefinite matrices to represent a multichannel version of non-negative elements. Multichannel Euclidean distance and multichannel Itakura-Saito (IS) divergence are defined based on appropriate statistical models utilizing multivariate complex Gaussian distributions. To minimize this distance/divergence, efficient optimization algorithms in the form of multiplicative updates are derived by using properly designed auxiliary functions. Two methods are proposed for clustering NMF bases according to the estimated spatial property. Convolutive blind source separation (BSS) is performed by the multichannel extensions of NMF with the clustering mechanism. Experimental results show that 1) the derived multiplicative update rules exhibited good convergence behavior, and 2) BSS tasks for several music sources with two microphones and three instrumental parts were evaluated successfully.
  • Keywords
    Gaussian distribution; Hermitian matrices; blind source separation; convergence of numerical methods; mathematical programming; matrix decomposition; microphones; statistical analysis; Hermitian positive semidefinite matrices; NMF; NMF bases clustering; appropriate statistical models; auxiliary functions; complex-valued data; convergence behavior; convolutive BSS; convolutive blind source separation; microphones; multichannel Euclidean distance; multichannel IS divergence; multichannel Itakura-Saito divergence; multichannel extensions; multichannel version; multiplicative updates; multivariate complex Gaussian distributions; nonnegative elements; nonnegative matrix factorization; optimization algorithms; three instrumental parts; Clustering algorithms; Conferences; Convergence; Euclidean distance; Matrix decomposition; Source separation; Time frequency analysis; Blind source separation; clustering; convolutive mixture; multichannel; non-negative matrix factorization;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2239990
  • Filename
    6410389