• DocumentCode
    1596383
  • Title

    Complex Wavelet Modulation Subbands for Speech Compression

  • Author

    Luneau, Jean-Marc ; Lebrun, Jérôme ; Jensen, Soren Holdt

  • Author_Institution
    Dept. of Electron. Syst., Aalborg Univ., Aalborg
  • fYear
    2009
  • Firstpage
    457
  • Lastpage
    457
  • Abstract
    Low-frequency modulation of sound carry essential information for speech and music. They must be preserved for compression. The complex modulation spectrum has already been used for audio compression and is commonly obtained by spectral analysis of the sole temporal envelopes of the subbands out of a time/frequency analysis (modified discrete cosine transform combined with a modified discrete sine transform). However, amplitudes and tones of speech or music tend to vary slowly over time thus the temporal envelopes are often smooth and mostly of polynomial type. Processing in this domain usually creates undesirable distortions because only the magnitudes are taken into account and the phase data is often neglected. We remedy this problem with the use of a complex wavelet transform as a more appropriate envelope and phase processing tool. Complex wavelets carry both magnitude and phase explicitly with great sparsity and preserve well polynomials. Moreover an analytic Hilbert-like transform is possible with complex wavelets implemented as an orthogonal filter bank.
  • Keywords
    audio signal processing; data compression; discrete cosine transforms; frequency-domain analysis; modulation; music; polynomials; spectral analysis; speech coding; time-domain analysis; wavelet transforms; analytic Hilbert-like transform; audio compression; complex wavelet modulation subbands; frequency analysis; low-frequency modulation; modified discrete cosine transform; modified discrete sine transform; music; orthogonal filter bank; phase processing tool; polynomials; sound; spectral analysis; speech compression; temporal envelopes; time analysis; wavelet transform; Audio compression; Discrete cosine transforms; Discrete transforms; Frequency; Music; Phase distortion; Polynomials; Spectral analysis; Speech; Wavelet transforms; complex wavelets; compression; modulation spectrum; scales; speech; subbands;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference, 2009. DCC '09.
  • Conference_Location
    Snowbird, UT
  • ISSN
    1068-0314
  • Print_ISBN
    978-1-4244-3753-5
  • Type

    conf

  • DOI
    10.1109/DCC.2009.52
  • Filename
    4976511