Title :
Spatially Scalable Video Coding For HEVC
Author :
Zhongbo Shi ; Xiaoyan Sun ; Feng Wu
Author_Institution :
Microsoft Res. Asia, Beijing, China
Abstract :
Spatially scalable video coding (SSVC) provides an efficient way to transmit one video at different resolutions. Based on the emerging High Efficiency Video Coding (HEVC), we propose an SSVC scheme to support both single-loop (SL) and multiloop (ML) solutions by enabling different interlayer prediction mechanisms. Specifically, we employ two interlayer prediction modes: quadtree-based prediction mode (Q-mode) and learning-based prediction mode (L-mode). The Q-mode is investigated to exploit the interlayer redundancy based on the quadtree coding structure of HEVC. Due to the high correlation between layers, Q-mode utilizes the coded information from the base layer quadtree, including coding unit split, prediction unit partition, motion information, and partial texture information of transform unit, to predict the enhancement layer quadtree. By enabling Q-mode, we provide a basic SL solution for low complexity applications. Besides the correlation explored in Q-mode, we employ an extra L-mode to further improve the coding performance. In L-mode, the temporal-spatial correlation is exploited simultaneously by visual patch-based learning and mapping at pixel level. This helps us achieve more accurate prediction signals based on the coarse base layer reconstruction within an ML structure. Experimental results show the effectiveness of our SSVC scheme compared with the simulcast case and other HEVC-based SSVC schemes.
Keywords :
image texture; learning (artificial intelligence); quadtrees; video coding; HEVC-based SSVC schemes; L-mode; ML solutions; Q-mode; SL solutions; base layer quadtree; coarse base layer reconstruction; coding unit split; enhancement layer quadtree; high efficiency video coding; interlayer prediction mechanisms; learning-based prediction mode; low complexity applications; motion information; multiloop solutions; partial texture information; pixel level; prediction signals; prediction unit partition; quadtree-based prediction mode; single-loop solutions; spatially scalable video coding; temporal-spatial correlation; transform unit; visual mapping; visual patch-based learning; Correlation; Encoding; Scalability; Spatial resolution; Standards; Video coding; High Efficiency Video Coding (HEVC); learning-based approach; scalable video coding (SVC);
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
DOI :
10.1109/TCSVT.2012.2223031