DocumentCode :
2277023
Title :
Perceptual video coding: Challenges and approaches
Author :
Chen, Zhenzhong ; Lin, Weisi ; Ngan, King Ngi
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2010
fDate :
19-23 July 2010
Firstpage :
784
Lastpage :
789
Abstract :
Investigation on the human perception can play an important role in video signal processing. Recently, there has been great interest in incorporating the human perception in video coding systems to enhance the perceptual quality of the represented visual signal. However, the limited understanding of the human visual system and high complexity of computational models of human visual system make it a challenging task. Furthermore, the hybrid video coding structure brings difficulties to integrate computational models with coding components to fulfill the requirements. In this paper, we review the physiological characteristics of human perception and address the most relevant aspects to video coding applications. Moreover, we discuss the computational models and metrics which guide the design and implementation of the video coding system, as well as the recent advances in perceptual video coding. To introduce this overview with the latest technologies and most promising directions in perceptual video coding, we focus on three key areas. Specifically, we cover 1) visual attention and sensitivity modeling, with which we concentrate on the computational models of bottom-up and top-down attention, contrast sensitivity functions and masking effects, and fovea based manipulations; 2) perceptual quality optimization for constrained video coding, with which we discuss how to achieve maximum perceptual quality whilst satisfying various constraints; and 3) the impact of the human perception on advanced video applications, including emerging immersive multimedia services, and compression of high dynamic range video content and 3D video. For each aspect, we discuss the major challenges, highlight significant approaches, and outline future research directions.
Keywords :
data compression; sensitivity analysis; video coding; 3D video; bottom-up attention; contrast sensitivity functions; fovea based manipulations; high dynamic range video content; human perception; human visual system; hybrid video coding structure; immersive multimedia services; masking effects; perceptual quality optimization; perceptual video coding; sensitivity modeling; top-down attention; video compression; video signal processing; visual attention; visual signal; Computational modeling; Encoding; Humans; Retina; Sensitivity; Video coding; Visualization; Perceptual video coding; human visual system; quality optimization; visual perception;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2010 IEEE International Conference on
Conference_Location :
Suntec City
ISSN :
1945-7871
Print_ISBN :
978-1-4244-7491-2
Type :
conf
DOI :
10.1109/ICME.2010.5582549
Filename :
5582549
Link To Document :
بازگشت