DocumentCode
83012
Title
Perceptual Video Coding Based on SSIM-Inspired Divisive Normalization
Author
Shiqi Wang ; Rehman, Akif ; Zhou Wang ; Siwei Ma ; Wen Gao
Author_Institution
Inst. of Digital Media, Peking Univ., Beijing, China
Volume
22
Issue
4
fYear
2013
fDate
Apr-13
Firstpage
1418
Lastpage
1429
Abstract
We propose a perceptual video coding framework based on the divisive normalization scheme, which is found to be an effective approach to model the perceptual sensitivity of biological vision, but has not been fully exploited in the context of video coding. At the macroblock (MB) level, we derive the normalization factors based on the structural similarity (SSIM) index as an attempt to transform the discrete cosine transform domain frame residuals to a perceptually uniform space. We further develop an MB level perceptual mode selection scheme and a frame level global quantization matrix optimization method. Extensive simulations and subjective tests verify that, compared with the H.264/AVC video coding standard, the proposed method can achieve significant gain in terms of rate-SSIM performance and provide better visual quality.
Keywords
discrete cosine transforms; matrix algebra; optimisation; video coding; H.264-AVC video coding standard; MB level perceptual mode selection scheme; SSIM index; SSIM-inspired divisive normalization scheme; biological vision perceptual sensitivity; discrete cosine transform domain frame; frame level global quantization matrix optimization method; macroblock level; normalization factors; perceptual video coding; structural similarity index; Discrete cosine transforms; Indexes; Optimized production technology; Quantization; Video coding; Divisive normalization; H.264/AVC coding; perceptual video coding; rate distortion optimization; structural similarity (SSIM) index;
fLanguage
English
Journal_Title
Image Processing, IEEE Transactions on
Publisher
ieee
ISSN
1057-7149
Type
jour
DOI
10.1109/TIP.2012.2231090
Filename
6373724
Link To Document