An intrinsic mode function basis dictionary for auditory signal processing

Author

Chang Gao ; Haifeng Li ; Lin Ma

Author_Institution

Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China

fYear

2014

fDate

7-9 July 2014

Firstpage

16

Lastpage

21

Abstract

As one important field of sparse representation, the research of dictionary learning attracts most researchers interest in signal processing study. Empirical Mode Decomposition (EMD), as an efficient and adaptive signal decomposition method that depends completely on the signal, is considered as an innovative and appropriative the basis function theory. The Intrinsic Mode Functions (IMFs) obtained by EMD are used as the basis of that expansion which can be linear or nonlinear as dictated by the data, and their linear combination is an efficient representation of the original signals. However, IMFs cannot directly engage in the sparse representation of signals, and their application to the auditory signal processing is quite limited. In this paper, we propose a universal algorithm for dictionary learning that transforms raw IMFs into valuable basis functions. The signals are decomposed into IMFs by EMD, then the general dictionary learning algorithm is implemented on these IMFs, finally, the IMF basis dictionary is learned. Experiments of sparse representation and reconstruction of speech signals are carried out to verify the effectiveness and efficiency of the proposed IMF basis dictionary. The results proved that the signal-to-noise ratio between the reconstructed speech signal and the original one is much higher comparing with other traditional dictionaries, and a better sparseness is achieved.

Keywords

learning (artificial intelligence); signal reconstruction; signal representation; speech processing; EMD; adaptive signal decomposition method; auditory signal processing; empirical mode decomposition; general dictionary learning algorithm; intrinsic mode function basis dictionary; sparse representation; speech signal reconstruction; Dictionaries; Encoding; Signal processing algorithms; Signal to noise ratio; Speech; Training; Empirical Mode Decomposition; Intrinsic Mode Functions; auditory signal processing; dictionary learning; sparse representation;

fLanguage

English

Publisher

ieee

Conference_Titel

Audio, Language and Image Processing (ICALIP), 2014 International Conference on

Conference_Location

Shanghai

Print_ISBN

978-1-4799-3902-2

Type

conf

DOI

10.1109/ICALIP.2014.7009748

Filename

7009748