Title :
Scalable audio separation with light Kernel Additive Modelling
Author :
Liutkus, Antoine ; Fitzgerald, Derry ; Rafii, Zafar
Author_Institution :
Speech Process. Team, Inria, Villers-lès-Nancy, France
Abstract :
Recently, Kernel Additive Modelling (KAM) was proposed as a unified framework to achieve multichannel audio source separation. Its main feature is to use kernel models for locally describing the spectrograms of the sources. Such kernels can capture source features such as repetitivity, stability over time and/or frequency, self-similarity, etc. KAM notably subsumes many popular and effective methods from the state of the art, including REPET and harmonic/percussive separation with median filters. However, it also comes with an important drawback in its initial form: its memory usage badly scales with the number of sources. Indeed, KAM requires the storage of the full-resolution spectrogram for each source, which may become prohibitive for full-length tracks or many sources. In this paper, we show how it can be combined with a fast compression algorithm of its parameters to address the scalability issue, thus enabling its use on small platforms or mobile devices.
Keywords :
audio signal processing; compressed sensing; median filters; source separation; KAM; REPET; compression algorithm; full-resolution spectrogram; harmonic-percussive separation; light kernel additive modelling; median filters; memory usage; mobile devices; multichannel audio source separation; Additives; Approximation methods; Harmonic analysis; Kernel; Random access memory; Source separation; Spectrogram; Kernel Additive Modelling; randomized algorithms; sound source separation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7177935