Realizing speech enhancement by combining EEMD and K-SVD dictionary training algorithm

Author

Hao Chen ; Zhenye Gan ; Hongwu Yang

Author_Institution

Coll. of Phys. & Electron. Eng., Northwest Normal Univ., Lanzhou, China

fYear

2014

fDate

12-14 Sept. 2014

Firstpage

378

Lastpage

378

Abstract

Summary form only given. This paper presents a speech enhancement algorithm that combines the ensemble empirical mode decomposition (EEMD) and the K-singular value decomposition (K-SVD) dictionary-training algorithm together to obtain clean speech from noisy speech. The EEMD algorithm is firstly employed to obtain intrinsic mode function (IMF) components from noisy speech. The cross-correlations and autocorrelations of each IMF are calculated from the IMF components to filter out the noisy IMF components. Meanwhile, the transition IMF components are again decomposed with EEMD to further remove the noisy component. The remained original IMFs alone with the remained transition IMFs are then superimposed to generate the new noisy speech. The new noisy speech is then sparse de-composed by the K-SVD dictionary-training algorithm with an over-complete dictionary trained from clean speech. Enhanced speech is obtained by recovering the speech signal from sparse coefficient vectors. Different from the traditional speech enhancement algorithms, the algorithm enhances the noisy speech by the sparse representation of noisy speech that has been pre-de-noised with EEMD algorithm previously. Experimental results show that the algorithm achieves significant de-noising results than the traditional spectral subtraction, wavelet threshold de-noising algorithm and K-SVD dictionary-training algorithm under both low SNR situation and high SNR situation.

Keywords

signal denoising; singular value decomposition; speech enhancement; wavelet transforms; EEMD algorithm; K-SVD dictionary training algorithm; K-SVD dictionary-training algorithm; K-singular value decomposition dictionary-training algorithm; SNR situation; autocorrelation; clean speech; cross-correlation; enhanced speech; ensemble empirical mode decomposition; intrinsic mode function component; noisy IMF components; noisy speech; sparse coefficient vector; sparse representation; spectral subtraction; speech enhancement algorithm; speech signal; wavelet threshold denoising algorithm; Gallium nitride; Correlation; EEMD; K-SVD; Speech Enhancement;

fLanguage

English

Publisher

ieee

Conference_Titel

Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on

Conference_Location

Singapore

Type

conf

DOI

10.1109/ISCSLP.2014.6936575

Filename

6936575