On the Information Theoretic Performance Comparison of Causal Video Coding and Predictive Video Coding

Author

En-Hui Yang ; Lin Zheng ; Da-ke He

Author_Institution

Dept. of Electr. & Comput. Eng., Univ. of Waterloo, Waterloo, ON, Canada

Volume

Issue

fYear

2014

fDate

Mar-14

Firstpage

1428

Lastpage

1446

Abstract

Causal video coding is a coding paradigm where video source frames X₁, X₂,..., X_N are encoded in a frame-by-frame manner, the encoder for each frame can use all previous source frames and all previous encoded frames, and the corresponding decoder can use only all previous encoded frames. In the special case where the encoder for each frame X_k is further restricted to enlist help only from all previous encoded frames, causal video coding is reduced to predictive video coding, which all MPEG-series and H-series video coding standards proposed so far are based upon. In this paper, we compare the rate distortion performance of causal video coding with that of predictive video coding from an information theoretic perspective by modeling each frame X_k itself as a source X_k={X_k(i)}_i=1^∞. Let R_c*(D_1,...,D_N) (R_p*(D1,...,DN), respectively) denote the minimum total rate required to achieve a given distortion level D₁,...,D_N in causal video coding (predictive video coding, respectively). We first show that like R_c*(D1,..., D_N), for jointly stationary and totally ergodic sources X₁, X₂,..., XN, R_p*(D₁,...,D_N) is equal to the infimum of the nth order total rate distortion function R_p,n(D1,...,DN) over all n, where R_p,n(D₁,...,D_N) itself is given by the minimum of an information quantity over a set of auxiliary random variables. We then prove that if the jointly stationary and totally ergodic sources X₁,..., X_N form a (first-order) Markov chain, we have R_p*(D₁,...,D_N)=R_c*(D₁,...,D_N). However, this is not true in general if X₁,..., X_N do not form a (first-order) Markov chain. Specifica- ly, we demonstrate that for independent and identically distributed vector source (X₁,..., X_N), if X₁,..., X_N do not form a (first-order) Markov chain, then under some conditions on source frames and distortion, R_c*(D₁,..., D_N) is strictly less than R_p*(D₁,..., D_N) in general. Our techniques allow us to compare R_p*(D₁,..., D_N) with R_c*(D₁,..., D_N) even when the single-letter characterization of R_p*(D₁,..., D_N), if any, is unknown.

Keywords

Markov processes; code standards; iterative decoding; linear predictive coding; rate distortion theory; video coding; H-series video coding standard; MPEG-series video coding standard; Markov chain; auxiliary random variables; causal video coding; distributed vector source; information quantity; information theory; iterative algorithm; predictive video coding; rate distortion theory; stationary ergodic source; video decoder; video source frame encoding; Encoding; Markov processes; Random variables; Rate-distortion; Standards; Vectors; Video coding; Causal video coding; iterative algorithm; more and less coding theorem; predictive video coding; rate distortion theory; stationary ergodic sources;

fLanguage

English

Journal_Title

Information Theory, IEEE Transactions on

Publisher

ieee

ISSN

0018-9448

Type

jour

DOI

10.1109/TIT.2013.2296523

Filename

6697880

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=42856