• DocumentCode
    1798506
  • Title

    An intrinsic mode function basis dictionary for auditory signal processing

  • Author

    Chang Gao ; Haifeng Li ; Lin Ma

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
  • fYear
    2014
  • fDate
    7-9 July 2014
  • Firstpage
    16
  • Lastpage
    21
  • Abstract
    As one important field of sparse representation, the research of dictionary learning attracts most researchers interest in signal processing study. Empirical Mode Decomposition (EMD), as an efficient and adaptive signal decomposition method that depends completely on the signal, is considered as an innovative and appropriative the basis function theory. The Intrinsic Mode Functions (IMFs) obtained by EMD are used as the basis of that expansion which can be linear or nonlinear as dictated by the data, and their linear combination is an efficient representation of the original signals. However, IMFs cannot directly engage in the sparse representation of signals, and their application to the auditory signal processing is quite limited. In this paper, we propose a universal algorithm for dictionary learning that transforms raw IMFs into valuable basis functions. The signals are decomposed into IMFs by EMD, then the general dictionary learning algorithm is implemented on these IMFs, finally, the IMF basis dictionary is learned. Experiments of sparse representation and reconstruction of speech signals are carried out to verify the effectiveness and efficiency of the proposed IMF basis dictionary. The results proved that the signal-to-noise ratio between the reconstructed speech signal and the original one is much higher comparing with other traditional dictionaries, and a better sparseness is achieved.
  • Keywords
    learning (artificial intelligence); signal reconstruction; signal representation; speech processing; EMD; adaptive signal decomposition method; auditory signal processing; empirical mode decomposition; general dictionary learning algorithm; intrinsic mode function basis dictionary; sparse representation; speech signal reconstruction; Dictionaries; Encoding; Signal processing algorithms; Signal to noise ratio; Speech; Training; Empirical Mode Decomposition; Intrinsic Mode Functions; auditory signal processing; dictionary learning; sparse representation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing (ICALIP), 2014 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4799-3902-2
  • Type

    conf

  • DOI
    10.1109/ICALIP.2014.7009748
  • Filename
    7009748