• DocumentCode
    2453635
  • Title

    A high-throughput, area-efficient hardware accelerator for adaptive deblocking filter in H.264/AVC

  • Author

    Nadeem, Muhammad ; Wong, Stephan ; Kuzmanov, Georgi ; Shabbir, Ahsan

  • Author_Institution
    Delft Univ. of Technol., Delft, Netherlands
  • fYear
    2009
  • fDate
    15-16 Oct. 2009
  • Firstpage
    18
  • Lastpage
    27
  • Abstract
    In this paper, we present a high-throughput, area-efficient, hardware accelerator for the deblocking filter in H.264/AVC video compression standard. In order to achieve this goal, we start with algorithmic optimization and propose a novel decomposition of the filter kernels for the deblocking filter. The proposed decomposition reduces the number of adders by 51% and thereby greatly reduces the area requirement for its implementation. Subsequently, at architecture level, while using two identical filtering units, the transpose units are realized by efficient reuse of hardware resources to further reduce the area requirement. The two filtering units process the horizontal and vertical edges of the macro-block simultaneously and therefore further enhance the throughput of the hardware accelerator. Several other optimization techniques, such as reuse of intermediate results, pipelining, and merging of processing blocks on critical path, result in a hardware accelerator for deblocking filter with high throughput at one hand and less area in terms of equivalent gates count on the other, when compared with existing state-of-the-art hardware accelerators in the literature. While working at clock frequency of 166 MHz, synthesized under 0.18 ¿m CMOS standard cell technology, it easily meets the throughput requirements of all the levels in H.264/AVC video coding standard and consumes only 12.06 K gates (excluding SRAM).
  • Keywords
    CMOS logic circuits; adaptive filters; cellular arrays; data compression; logic arrays; video coding; CMOS standard cell technology; H.264/AVC; adaptive deblocking filter; algorithmic optimization; area requirement reduction; clock frequency; filter kernels decomposition; frequency 166 MHz; hardware accelerator; hardware resources reuse; size 0.18 mum; transpose units; video compression standard; Adaptive filters; Automatic voltage control; CMOS technology; Filtering; Hardware; Kernel; Merging; Pipeline processing; Throughput; Video compression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Embedded Systems for Real-Time Multimedia, 2009. ESTIMedia 2009. IEEE/ACM/IFIP 7th Workshop on
  • Conference_Location
    Grenoble
  • Print_ISBN
    978-1-4244-5169-2
  • Electronic_ISBN
    978-1-4244-5170-8
  • Type

    conf

  • DOI
    10.1109/ESTMED.2009.5336814
  • Filename
    5336814