Title :
Initialization of nonnegative matrix factorization dictionaries for single channel source separation
Author :
Grais, E.M. ; Erdogan, H.
Author_Institution :
Fac. of Eng. & Natural Sci., Sabanci Univ., Istanbul, Turkey
Abstract :
In this work, we study different initialization methods for the nonnegative matrix factorization (NMF) dictionaries or bases. There is a need for good initializations for NMF dictionary because NMF decomposition is a non-convex problem which has many local minima. The effect of the initialization of NMF is evaluated in this work on audio source separation applications. In supervised audio source separation, NMF is used to train a set of basis vectors (basis matrix) for each source in an iterative fashion. Then NMF is used to decompose the mixed signal spectrogram as a weighted linear combination of the trained basis vectors for all sources in the mixed signal. The estimate for each source is computed by summing the decomposition terms that include its corresponding trained bases. In this work, we use principal component analysis (PCA), spherical K-means, and fuzzy C-means (FCM) to initialize the NMF basis matrices during the training procedures. Experimental results show that, better initialization for NMF bases gives better audio separation performance than using NMF with random initialization.
Keywords :
audio signal processing; concave programming; fuzzy set theory; iterative methods; learning (artificial intelligence); principal component analysis; random processes; singular value decomposition; source separation; vectors; FCM; NMF basis matrix; NMF decomposition; NMF dictionary; audio source separation; basis vector training; basis vectors; channel source separation; fuzzy C-means; iterative method; mixed signal spectrogram decomposition; nonconvex problem; nonnegative matrix factorization; principal component analysis; random initialization; spherical K-means; weighted linear combination; Matrix decomposition; Principal component analysis; Source separation; Spectrogram; Speech; Training; Vectors; Nonnegative matrix factorization; data clustering; dictionary learning; fuzzy clustering; principal component analysis; single channel source separation;
Conference_Titel :
Signal Processing and Communications Applications Conference (SIU), 2013 21st
Conference_Location :
Haspolat
Print_ISBN :
978-1-4673-5562-9
Electronic_ISBN :
978-1-4673-5561-2
DOI :
10.1109/SIU.2013.6531172