DocumentCode
3442158
Title
H.264 visual perceptual coding in uniform analyzing and encoding framework
Author
Zheng, Yayu ; Zhu, Wei ; Chen, Peng
fYear
2012
fDate
16-18 Oct. 2012
Firstpage
29
Lastpage
37
Abstract
The perception of Human Visual System (HVS) for the video scene is selective. Different regions in the video scene have distinctive levels of visual importance. In this study, we present a novel H.264 visual perceptual video coding (VPVC) method in a uniform analyzing and encoding framework, which can allocate bit and computation resources effectively. The framework presented in this work consists of a visual perception model, a H.264 perceptual encoder and a corresponding sharing channel of information. The visual perception model, composed of motion perception, texture perception and spatial position perception sub-models, can compute the visual perception map (VPM) by fusing these spatiotemporal visual features. Visual perception results of HVS for various regions can be simulated well by VPM. The side encoding information of H.264 encoder, including motion vectors (MVs) and sum of absolute differences (SADs), is applied as input features for motion perception sub-model. A novel VPVC method is proposed based on the VPM and the global motion type of video scene. Using an adaptive frequency coefficient suppression technique and a novel encoding strategy, the optimal bit resource allocation is achieved by classifying video scene based on the VPM. In order to allocate computation resource effectively in VPVC method, the relation between optimal encoding mode and image features at video scene level is experimentally analyzed. As a result, a fast and effective H.264 mode analysis algorithm is deduced. When compared with the conventional H.264 coding method, our results on four video sequences show that the proposed method can obtain a high PSNR gain up to about 2.0 dB for visual important regions and decrease about 38% of total encoding time on average.
Keywords
H.264; ROI-based coding; human visual system; visual perception;
fLanguage
English
Publisher
ieee
Conference_Titel
Image and Signal Processing (CISP), 2012 5th International Congress on
Conference_Location
Chongqing, Sichuan, China
Print_ISBN
978-1-4673-0965-3
Type
conf
DOI
10.1109/CISP.2012.6469642
Filename
6469642
Link To Document