Author_Institution :
Grad. Inst. of Networking & Multimedia, Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
Summary form only given. Distributed video coding (DVC) has been intensively studied in recent years. This new coding paradigm substantially differs from conventional prediction-based video codecs such as MPEG and H.26x, which are characterized by a complex encoder and simple decoder. The conventional DVC codec, e.g., DISCOVER codec, uses advanced frame interpolation techniques to create SI based on adjacent decoded reference frames. The quality of SI is a well-recognized factor in the RD performance of WZ video coding. A high SI quality implies a high correlation between the created SI and the original WZ frame, which then decreases the rate required to achieve a given decoded quality. Clearly, the performance of an SI creation process based on adjacent previously decoded frames is limited by the quality of the past and the future reference frames as well as the distance and motion behavior between them. The correlation between high-motion frames is low and vice versa. That is, SI quality in the conventional codecs depends on the temporal correlation of key frames, which affects the bitrate and PSNR of the compression process. In this work, a novel DVC architecture for dealing with the cases of high-motion and large GOP-size sequences is proposed to better the rate-distortion (RD) performance. For high-motion video sequences, the proposed architecture generates SI by using subsampled spatial information instead of interpolated temporal information. the proposed approach separates the video sequence into subsampled key frames and corresponding WZ frames, which changes the creation of SI. That is, all successive frames on the encoder side are downsized to sub-frames, which are then compressed by an H.264/AVC intra encoder. Experimental results reveal that the subsampling input based DVC codec can gain up to 1.47 dB in the RD measures and maintains the most important characteristic of the DVC codec, the encoder is lightweight, as compared with the conventional WZ cod- c, respectively. The novel DVC architecture evaluated in this study exploits spatial relations to create SI. The experimental results confirm that the RD performance of the proposed approach is superior to that of the conventional one for high-motion and/or large GOP-size sequences. The quality of spatial interpolation based SI is higher than that of the temporal interpolation one, which leads to a high-PSNR reconstructed WZ frame. The subsampled key frames are also decoded by LDPCA decoder to recover the information lost when H.264/AVC intra coding is used to increase PSNR gain. Since many spatial domain interpolation and super resolution schemes have been proposed for use in the fields of image processing and computer vision, the performance of the proposed DVC codec can be further enhanced by using better schemes to generate even better SI.
Keywords :
computer vision; data compression; image reconstruction; image resolution; image sampling; image sequences; interpolation; parity check codes; video codecs; video coding; GOP-size sequences; H.264-AVC intraencoder; H.26x; LDPCA decoder; MPEG; RD performance; SI creation process; WZ video coding; Wyner-Ziv video coding; adjacent decoded reference frames; advanced frame interpolation techniques; compression process; computer vision; distributed video coding; high-PSNR reconstructed WZ frame; high-motion frames; high-motion video sequences; image processing; interpolated temporal information; key frame temporal correlation; prediction-based video codecs; rate-distortion performance; spatial domain interpolation; subsampled spatial information; subsampling input based DVC codec; subsampling input based side information creation; super resolution schemes; Codecs; Computer architecture; Correlation; Image coding; Interpolation; Silicon; Video coding; DISCOVER; Distributed Video Coding; Wyner-Ziv coding; subsampling input based;