DocumentCode :
294777
Title :
Image coding with mixed representations and visual masking
Author :
Zhu, Bin ; Tewfik, Ahmed H. ; Gerek, Ömer Nexih
Author_Institution :
Dept. of Electr. Eng., Minnesota Univ., Minneapolis, MN, USA
Volume :
4
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
2327
Abstract :
We propose a novel approach for low bit rate perceptually transparent image compression. It exploits both frequency and spatial visual masking effects and uses a combination of Fourier and wavelet transforms to encode different bands. Frequency domain masking is computed by using a fine to coarse analysis step. Spatial domain masking is computed either by using Girod´s (1989) model or a coarse to fine analysis step that accurately computes local contrast. A discrete cosine transform is used in conjunction with frequency domain masking to encode the low frequency bands. The medium and high frequency bands are encoded using spatial domain masking and a wavelet transform. The encoding of these bands is based on a recursive selection of the important edges in each band. It uses cross-band prediction to minimize the bit rate. Experiments show the approach can achieve a very high quality to nearly transparent compression at bit rates of 0.2 to 0.4 bits/pixel
Keywords :
Fourier transforms; data compression; discrete cosine transforms; image coding; image representation; prediction theory; transform coding; visual perception; wavelet transforms; DCT; Fourier transforms; coarse to fine analysis step; cross-band prediction; discrete cosine transform; entropy coding; experiments; fine to coarse analysis step; frequency domain masking; frequency visual masking effects; high frequency bands; image coding; local contrast; low bit rate; low frequency bands; medium frequency bands; mixed representations; perceptually transparent image compression; quantisation; spatial visual masking effects; wavelet transforms; Bit rate; Discrete Fourier transforms; Discrete cosine transforms; Discrete wavelet transforms; Filters; Fourier transforms; Frequency domain analysis; Image coding; Masking threshold; Quantization; Wavelet domain; Wavelet transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479958
Filename :
479958
Link To Document :
بازگشت