DocumentCode :
343518
Title :
Rate-constrained self-organizing neural maps and efficient psychovisual methods for low bit rate video coding
Author :
Ferguson, Keith L. ; Allinson, Nigel M.
Author_Institution :
Dept. of Electr. Eng. & Electron., Univ. of Manchester Inst. of Sci. & Technol., UK
fYear :
1999
fDate :
36373
Firstpage :
390
Lastpage :
399
Abstract :
The video coding problem is essentially an operational distortion-rate issue where the underlying input pixel data, probability distributions and dimensions are discrete, unknown and not smooth. In the low bit rate case the high resolution assumptions for vector quantization are not strictly valid and the problem is exacerbated. However, by considering the rate-constrained operational points on sets of self-organizing neural maps (SOMs), provides a methodology for selecting locally optimal vector quantizers. The learning process of the standard SOM algorithm is modified to minimize the distortion subject to a constraint of entropy approximation. The applied training set is adapted to suit the proposed coding environment. Operating in the discrete wavelet transform (DWT) domain is well suited to the inclusion of a psychovisual model. The spatial frequency response, the multiresolution scene analysis and the central focusing aspects of the visual cortex are incorporated into the model. The resulting video coding algorithm is bit rate scalable from 10 k bits per second (bits/s) and provides subjectively acceptable video at a fixed frame rate or 10 frames per second (f.p.s.) with a QCIF pixel resolution
Keywords :
discrete wavelet transforms; entropy; frequency response; image resolution; probability; rate distortion theory; self-organising feature maps; teleconferencing; transform coding; vector quantisation; video coding; visual perception; QCIF pixel resolution; SOM algorithm; discrete wavelet transform; distortion minimization; efficient psychovisual methods; entropy approximation; frame rate; high resolution; input pixel data; learning process; locally optimal vector quantizers; low bit rate video coding; multiresolution scene analysis; operational distortion-rate; probability distributions; rate-constrained self-organizing neural maps; spatial frequency response; vector quantization; video coding algorithm; videoconferencing; visual cortex; Approximation algorithms; Bit rate; Brain modeling; Discrete wavelet transforms; Entropy; Probability distribution; Psychology; Vector quantization; Video coding; Wavelet domain;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks for Signal Processing IX, 1999. Proceedings of the 1999 IEEE Signal Processing Society Workshop.
Conference_Location :
Madison, WI
Print_ISBN :
0-7803-5673-X
Type :
conf
DOI :
10.1109/NNSP.1999.788158
Filename :
788158
Link To Document :
بازگشت