DocumentCode :
1323873
Title :
Single-Channel Source Separation Using EMD-Subband Variable Regularized Sparse Features
Author :
Gao, Bin ; Woo, W.L. ; Dlay, S.S.
Author_Institution :
Sch. of Electr., Electron. & Comput. Eng., Newcastle Univ., Newcastle upon Tyne, UK
Volume :
19
Issue :
4
fYear :
2011
fDate :
5/1/2011 12:00:00 AM
Firstpage :
961
Lastpage :
976
Abstract :
A novel approach to solve the single-channel source separation (SCSS) problem is presented. Most existing supervised SCSS methods resort exclusively to the independence waveform criteria as exemplified by training the prior information before the separation process. This poses a significant limiting factor to the applicability of these methods to real problem. Our proposed method does not require training knowledge for separating the mixture and it is based on decomposing the mixture into a series of oscillatory components termed as the intrinsic mode functions (IMFs). We show, in this paper, that the IMFs have several desirable properties unique to SCSS problem and how these properties can be advantaged to relax the constraints posed by the problem. In addition, we have derived a novel sparse non-negative matrix factorization to estimate the spectral bases and temporal codes of the sources. The proposed algorithm is a more complete and efficient approach to matrix factorization where a generalized criterion for variable sparseness is imposed onto the solution. Experimental testing has been conducted to show that the proposed method gives superior performance over other existing approaches.
Keywords :
matrix decomposition; source separation; EMD-subband variable regularized-sparse features; IMF; empirical mode decomposition; independence waveform criteria; intrinsic mode functions; oscillatory components; single-channel source separation; sparse nonnegative matrix factorization; spectral base estimation; supervised SCSS methods; temporal code estimation; Algorithm design and analysis; Hidden Markov models; Psychoacoustic models; Source separation; Sparse matrices; Speech; Training; Audio processing; blind source separation (BSS); empirical mode decomposition (EMD); non-negative matrix factorization (NMF); single-channel source separation (SCSS); sparse features;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2010.2072500
Filename :
5570953
Link To Document :
بازگشت