DocumentCode
1596383
Title
Complex Wavelet Modulation Subbands for Speech Compression
Author
Luneau, Jean-Marc ; Lebrun, Jérôme ; Jensen, Soren Holdt
Author_Institution
Dept. of Electron. Syst., Aalborg Univ., Aalborg
fYear
2009
Firstpage
457
Lastpage
457
Abstract
Low-frequency modulation of sound carry essential information for speech and music. They must be preserved for compression. The complex modulation spectrum has already been used for audio compression and is commonly obtained by spectral analysis of the sole temporal envelopes of the subbands out of a time/frequency analysis (modified discrete cosine transform combined with a modified discrete sine transform). However, amplitudes and tones of speech or music tend to vary slowly over time thus the temporal envelopes are often smooth and mostly of polynomial type. Processing in this domain usually creates undesirable distortions because only the magnitudes are taken into account and the phase data is often neglected. We remedy this problem with the use of a complex wavelet transform as a more appropriate envelope and phase processing tool. Complex wavelets carry both magnitude and phase explicitly with great sparsity and preserve well polynomials. Moreover an analytic Hilbert-like transform is possible with complex wavelets implemented as an orthogonal filter bank.
Keywords
audio signal processing; data compression; discrete cosine transforms; frequency-domain analysis; modulation; music; polynomials; spectral analysis; speech coding; time-domain analysis; wavelet transforms; analytic Hilbert-like transform; audio compression; complex wavelet modulation subbands; frequency analysis; low-frequency modulation; modified discrete cosine transform; modified discrete sine transform; music; orthogonal filter bank; phase processing tool; polynomials; sound; spectral analysis; speech compression; temporal envelopes; time analysis; wavelet transform; Audio compression; Discrete cosine transforms; Discrete transforms; Frequency; Music; Phase distortion; Polynomials; Spectral analysis; Speech; Wavelet transforms; complex wavelets; compression; modulation spectrum; scales; speech; subbands;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference, 2009. DCC '09.
Conference_Location
Snowbird, UT
ISSN
1068-0314
Print_ISBN
978-1-4244-3753-5
Type
conf
DOI
10.1109/DCC.2009.52
Filename
4976511
Link To Document