Title :
Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation
Author :
Ozerov, Alexey ; Févotte, Cédric ; Blouet, Raphaël ; Durrieu, Jean-Louis
Author_Institution :
Rennes Bretagne Atlantique, INRIA, Rennes, France
Abstract :
Separating multiple tracks from professionally produced music recordings (PPMRs) is still a challenging problem. We address this task with a user-guided approach in which the separation system is provided segmental information indicating the time activations of the particular instruments to separate. This information may typically be retrieved from manual annotation. We use a so-called multichannel nonnegative tensor factorization (NTF) model, in which the original sources are observed through a multichannel convolutive mixture and in which the source power spectrograms are jointly modeled by a 3-valence (time/frequency/source) tensor. Our user-guided separation method produced competitive results at the 2010 Signal Separation Evaluation Campaign, with sufficient quality for real-world music editing applications.
Keywords :
audio signal processing; convolution; music; source separation; tensors; 3-valence tensor; PPMR; multichannel NTF model; multichannel convolutive mixture; multichannel nonnegative tensor factorization; multiple track separation; professionally produced music recording; real-world music editing application; segmental information; source power spectrograms; structured constraints; user-guided audio source separation; user-guided separation method; Lead; Noise; Source separation; Spectrogram; Speech; Tensile stress; Time frequency analysis; Audio source separation; generalized expectation maximization; nonnegative tensor factorization; user-guided;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5946389