Title :
Scalable Video Coding Deblocking Filter FPGA and ASIC Implementation Using High-Level Synthesis Methodology
Author :
Carballo, Pedro P. ; Espino, Omar ; Neris, Romen ; Hernandez-Fernandez, Pedro ; Szydzik, Tomasz M. ; Nunez, A.
Author_Institution :
Inst. for Appl. Microelectron. (IUMA), Univ. Las Palmas de Gran Canaria, Las Palmas de Gran Canaria, Spain
Abstract :
This paper describes key concepts in the design and implementation of a deblocking filter (DF) for a H.264/SVC video decoder. The DF supports QCIF and CIF video formats with temporal and spatial scalability. The design flow starts from a SystemC functional model and has been refined using high-level synthesis methodology to RTL micro architecture. The process is guided with performance measurements (latency, cycle time, power, resource utilization) with the objective of assuring the quality of results of the final system. The functional model of the DF is created in an incremental way from the AVC DF model using OpenSVC source code as reference. The design flow continues with the logic synthesis and the implementation on the FPGA using various strategies. The FPGA implementation is capable to run at 100 MHz, and macro blocks are processed in 6, 500 clock cycles for a throughput of 130 fps for QCIF format and 37 fps for CIF format. A validation platform has been developed using the embedded PowerPC processor in the FPGA, composing a SoC that integrates the tasks for frame generation and visualization on a TFT screen. The FPGA implements both the DF core and a General Purpose Memory Controller (GPMC) slave core. Both cores are connected to the PowerPC440 embedded processor using Local Link interfaces. The FPGA also contains a local memory capable of storing information necessary to filter a complete frame and to store a decoded picture frame. The complete system is implemented in a Virtex5 FX70T device. An ASIC implementation of the deblocking filter has been done using UMC CMOS 65nm technology. The ASIC implementation is running at 181.8 MHz, occupying an area of 596, 392.4 μm2.
Keywords :
DRAM chips; embedded systems; field programmable gate arrays; filtering theory; high level synthesis; performance evaluation; power aware computing; resource allocation; source coding; system-on-chip; user interfaces; video codecs; video coding; ASIC implementation; AVC DF model; CIF video formats; DF core; FPGA implementation; GPMC slave core; H.264-SVC video decoder; LocalLink interfaces; OpenSVC source code; PowerPC440 embedded processor; QCIF video formats; RTL microarchitecture; SoC; SystemC functional model; TFT screen; UMC CMOS technology; Virtex5 FX70T device; clock cycles; cycle time; design flow; embedded PowerPC processor; frame generation; frame visualization; frequency 100 MHz; general purpose memory controller slave core; high-level synthesis methodology; information storage; latency; local memory; logic synthesis; macroblock processing; performance measurements; power utilization; resource utilization; scalable video coding deblocking filter; size 65 nm; spatial scalability; temporal scalability; Decoding; Field programmable gate arrays; Hardware; Information filtering; Random access memory; Scalability; ASIC; Deblocking filter; FPGA; H.264/SVC; SystemC; hardware accelerator; high-level synthesis;
Conference_Titel :
Digital System Design (DSD), 2013 Euromicro Conference on
Conference_Location :
Los Alamitos, CA
DOI :
10.1109/DSD.2013.52