DocumentCode :
2399563
Title :
Adaptive wavelet subband coding for music compression
Author :
Ferens, K. ; Kinsner, W.
Author_Institution :
Dept. of Electr. & Comput. Eng., Manitoba Univ., Winnipeg, Man., Canada
fYear :
1995
fDate :
28-30 Mar 1995
Firstpage :
460
Abstract :
This paper describes modelling of the coefficient domain in wavelet subbands of wideband audio signals for low-bit rate and high-quality compression. The purpose is to develop models of the perception of wideband audio signals in the wavelet domain. The coefficients in the wavelet subbands are quantized using a scheme that adapts to the subband signal by setting the quantization step size for a particular subband to a size that is inversely proportional to the subband energy, and then, within a subband, by modifying the energy determined step size as inversely proportional to the amplitude probability density of the coefficient. The amplitude probability density of the coefficients in each subband is modelled using learned vector/scalar quantization employing frequency sensitive competitive learning. The source data consists of 1-channel, 16-bit linear data sampled at 44.1 kHz from a CD containing major classical and pop music. Preliminary results show a bit-rate of 150 kbps, rather than 705.6 kbps, with no perceptual loss in quality. The wavelet transform provides better results for representing multifractal signals, such as wide band audio, than do other standard transforms, such as the Fourier transform
Keywords :
adaptive signal processing; audio coding; audio discs; audio signals; music; signal representation; unsupervised learning; vector quantisation; wavelet transforms; 150 kbit/s; 16 bit; 44.1 kHz; CD; adaptive wavelet subband coding; amplitude probability density; classical music; frequency sensitive competitive learning; high quality compression; learned vector/scalar quantization; linear data; low bit rate; multifractal signals; music compression; pop music; quantization step size; signal representation; source data; subband energy; wavelet domain; wavelet subbands coefficients; wavelet transform; wideband audio signals; Data compression; Fourier transforms; Fractals; Frequency; Multiple signal classification; Psychoacoustic models; Signal resolution; Wavelet domain; Wavelet transforms; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference, 1995. DCC '95. Proceedings
Conference_Location :
Snowbird, UT
ISSN :
1068-0314
Print_ISBN :
0-8186-7012-6
Type :
conf
DOI :
10.1109/DCC.1995.515570
Filename :
515570
Link To Document :
بازگشت