• DocumentCode
    52758
  • Title

    Adaptive Inter-Mode Decision for HEVC Jointly Utilizing Inter-Level and Spatiotemporal Correlations

  • Author

    Liquan Shen ; Zhaoyang Zhang ; Zhi Liu

  • Author_Institution
    Key Lab. of Adv. Display & Syst. Applic., Shanghai Univ., Shanghai, China
  • Volume
    24
  • Issue
    10
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    1709
  • Lastpage
    1722
  • Abstract
    High Efficiency Video Coding (HEVC) adopts the quadtree structured coding unit (CU), which allows recursive splitting into four equally sized blocks. At each depth level, it enables SKIP mode, merge mode, inter 2N × 2N, inter 2N × N, inter N × 2N, inter 2N × nU, inter 2N × nD, inter nL x 2N, inter nR × 2N, inter N × N (only available for the smallest CU), intra 2N × 2N, and intra N × N (only available for the smallest CU) in inter-frames. Similar to H.264/AVC, the mode decision process in HEVC is performed using all the possible depth levels (or CU sizes) and prediction modes to find the one with the least rate distortion (RD) cost using Lagrange multiplier. This achieves the highest coding efficiency, but leads to a very high computational complexity. Since the optimal prediction mode is highly content dependent, it is not efficient to use all the modes. In this paper, we propose a fast inter-mode decision algorithm for HEVC by jointly using the inter-level correlation of quadtree structure and the spatiotemporal correlation. There exist strong correlations of the prediction mode, the motion vector and RD cost between different depth levels and between spatially temporally adjacent CUs. We statistically analyze the prediction mode distribution at each depth level and the coding information correlation among the adjacent CUs. Based on the analysis results, three adaptive inter-mode decision strategies are proposed including early SKIP mode decision, prediction size correlation-based mode decision and RD cost correlation-based mode decision. Experimental results show that the proposed overall algorithm can save 49%-52% computational complexity on average with negligible loss of coding efficiency, exhibiting applicability to various types of video sequences.
  • Keywords
    computational complexity; image motion analysis; prediction theory; quadtrees; rate distortion theory; spatiotemporal phenomena; video coding; HEVC; Lagrange multiplier; SKIP mode; adaptive inter-mode decision; coding efficiency; coding information correlation; computational complexity; high efficiency video coding; inter-level correlation; least rate distortion; merge mode; optimal prediction mode; prediction mode distribution; quadtree structured coding unit; spatiotemporal correlation; spatiotemporal correlations; video sequences; Algorithm design and analysis; Correlation; Electronic mail; Encoding; Materials; Prediction algorithms; Video coding; High Efficiency Video Coding (HEVC); mode decision; motion estimation (ME);
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2014.2313892
  • Filename
    6778776