Title :
Source separation with scattering Non-Negative Matrix Factorization
Author :
Bruna, Joan ; Sprechmann, Pablo ; Yann LeCun
Author_Institution :
Courant Inst. of Math. Sci., New York Univ., New York, NY, USA
Abstract :
This paper presents a single-channel source separation method that extends the ideas of Nonnegative Matrix Factorization (NMF). We interpret the approach of audio demixing via NMF as a cascade of a pooled analysis operator, given for example by the magnitude spectrogram, and a synthesis operators given by the matrix decomposition. Instead of imposing the temporal consistency of the decomposition through sophisticated structured penalties in the synthesis stage, we propose to change the analysis operator for a deep scattering representation, where signals are represented at several time resolutions. This new signal representation is invariant to smooth changes in the signal, consistent with its temporal dynamics. We evaluate the proposed approach in a speech separation task obtaining promising results.
Keywords :
matrix decomposition; signal representation; signal resolution; source separation; NMF; audio demixing; deep scattering representation; magnitude spectrogram; matrix decomposition; nonnegative matrix factorization; pooled analysis operator; signal representation; signal resolution; single-channel source separation method; speech separation task; temporal dynamics; Dictionaries; Scattering; Source separation; Speech; Time-frequency analysis; Wavelet transforms; non-negative matrix factorization; scattering; source separation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178296