An Estimation-Theoretic Framework for Spatially Scalable Video Coding

Author

Jingning Han ; Melkote, Vinay ; Rose, Kenneth

Author_Institution

Dept. of Electr. & Comput. Eng., Univ. of California at Santa Barbara, Santa Barbara, CA, USA

Volume

23

Issue

8

fYear

2014

fDate

Aug. 2014

Firstpage

3684

Lastpage

3697

Abstract

This paper focuses on prediction optimality in spatially scalable video coding. It draws inspiration from an estimation-theoretic prediction framework for quality (SNR) scalability earlier developed by our group, which achieved optimality by fully accounting for relevant information from the current base layer (e.g., quantization intervals) and the enhancement layer, to efficiently calculate the conditional expectation that forms the optimal predictor. It was central to that approach that all layers reconstruct approximations to the same original transform coefficient. In spatial scalability, however, the layers encode different resolution versions of the signal. To approach optimality in enhancement layer prediction, this paper departs from existing spatially scalable codecs that employ pixel domain resampling to perform interlayer prediction. Instead, it incorporates a transform domain resampling technique that ensures that the base layer quantization intervals are accessible and usable at the enhancement layer despite their differing signal resolutions, which in conjunction with prior enhancement layer information, enable optimal prediction. A delayed prediction approach that complements this framework for spatial scalable video coding is then provided to further exploit future base layer frames for additional enhancement layer coding performance gains. Finally, a low-complexity variant of the proposed estimation-theoretic prediction approach is also devised, which approximates the conditional expectation by switching between three predictors depending on a simple condition involving information from both layers, and which retains significant performance gains. Simulations provide experimental evidence that the proposed approaches substantially outperform the standard scalable video codec and other leading competitors.

Keywords

estimation theory; image enhancement; image resolution; image sampling; prediction theory; quantisation (signal); transforms; video codecs; video coding; additional enhancement layer coding performance gains; base layer quantization intervals accessibility; current base layer; delayed prediction approach; enhancement layer prediction; estimation-theoretic prediction framework; layer approximations reconstruction; layer frames; low-complexity variant; optimal prediction; quality SNR scalability; scalable video codec; signal resolutions; spatially scalable codecs; spatially scalable video coding; transform coefficient; transform domain resampling technique; Image coding; Image reconstruction; Quantization (signal); Spatial resolution; Static VAr compensators; Transforms; Video coding; Spatial scalability; delayed prediction; estimation-theoretic prediction; scalable video coding; transform domain resampling;

fLanguage

English

Journal_Title

Image Processing, IEEE Transactions on

Publisher

ieee

ISSN

1057-7149

Type

jour

DOI

10.1109/TIP.2014.2331761

Filename

6839025