Title :
Efficient algorithms for multichannel extensions of Itakura-Saito nonnegative matrix factorization
Author :
Sawada, Hiroshi ; Kameoka, Hirokazu ; Araki, Shoko ; Ueda, Naonori
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
Abstract :
This paper proposes new algorithms for multichannel extensions of nonnegative matrix factorization (NMF) with the Itakura-Saito (IS) divergence. We employ Hermitian positive definite matrices for modeling the covariance matrix of a multivariate complex Gaussian distribution. Such matrices are basically estimated for NMF bases, but a source separation task can be performed by introducing variables that relate NMF bases and sources. The new algorithms are derived by using a majorization scheme with properly designed auxiliary functions. The algorithms are in the form of multiplicative updates, and exhibit good convergence behavior. We have succeeded in separating a professionally produced music recording into its vocal and guitar components.
Keywords :
Gaussian distribution; covariance matrices; source separation; Hermitian positive definite matrices; Itakura-Saito divergence; Itakura-Saito nonnegative matrix factorization; NMF; auxiliary functions; covariance matrix; guitar components; majorization scheme; multichannel extensions; multivariate complex Gaussian distribution; vocal components; Algorithm design and analysis; Convergence; Covariance matrix; Microphones; Minimization; Source separation; Time frequency analysis; Auxiliary function; Itakura-Saito divergence; Multichannel; Nonnegative matrix factorization; Source separation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6287867