Title :
Vocal separation using extended robust principal component analysis with Schatten p/lp-norm and scale compression
Author :
Il-Young Jeong ; Kyogu Lee
Author_Institution :
Grad. Sch. of Convergence Sci. & Technol., Seoul Nat. Univ., Seoul, South Korea
Abstract :
Separating vocal and accompaniment signals from a monaural music signal is a challenging task. Recently, robust principal component analysis (RPCA) has been proposed for use in the magnitude spectrogram domain to separate the low-rank and sparse residual matrices, which are assumed to represent the accompaniment and vocal signals, respectively. In this paper, we propose two extended methods based on the RPCA algorithm for more effective vocal separation. First, we extend the conventional RPCA and propose to use in the spectrogram decomposition framework Schatten p- and lp-norms, which are generalized versions of the nuclear norm and l1-norm used in RPCA, respectively. Second, we apply proper scale compression to the magnitude spectrogram, making it a more appropriate representation for the decomposition. Experiments using the MIR-1K dataset show that the proposed methods yield significantly better separation performance than the conventional RPCA.
Keywords :
audio signal processing; matrix algebra; music; principal component analysis; RPCA algorithm; Schatten P/Lp-norm; low-rank residual matrix; magnitude spectrogram; monaural music signal; nuclear norm; robust principal component analysis; scale compression; sparse residual matrix; spectrogram decomposition; vocal separation; Harmonic analysis; Linear programming; Multiple signal classification; Power harmonic filters; Robustness; Sparse matrices; Spectrogram; Schatten p-norm; Vocal separation; lp-norm; robust principal component analysis; scale compression;
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2014 IEEE International Workshop on
Conference_Location :
Reims
DOI :
10.1109/MLSP.2014.6958909