Title :
An optimized hardware architecture for intra prediction in H.264 decoder
Author :
Qi Wang ; Quanquan Li ; Shi Chen ; Tiejun Zhang ; Chaohuan Hou
Author_Institution :
Digital Syst. Integration Lab., Inst. of Acoust., Beijing, China
Abstract :
The advanced video coding standard H.264 is widely used to compress video data for applications such as Digital Still Camera (DSC), Digital Video Camera (DVC), Television Studio Broadcast, and Surveillance Video. Intra prediction is one of the most critical issues in H.264 decoding in terms of processing cycles and computation complexity, it is the system bottleneck of real-time H.264 decoding. This paper presents an area optimized memory update mechanism. For luma 4×4 block prediction, the proposed architecture only adds four registers to store the corner pixels based on luma 16×16 block prediction. A high efficient 4×4 block level pipeline is also presented in this paper. The pipelined architecture eliminates data dependencies between two neighboring blocks thoroughly by inserting chroma 4×4 block processing into luma 4×4 block processing. The proposed architecture is verified to work at 160MHz in a Xilinx Virtex-II pro FPGA. It costs about 23k gates by using SIMC 90nm technology and supports real-time decoding with 1080p format video in 30fps.
Keywords :
computational complexity; data compression; image sensors; video coding; DSC; H.264 decoding; Xilinx Virtex-II pro FPGA; block prediction; computation complexity; corner pixels; digital still camera; digital video camera; intra prediction; optimized hardware architecture; pipelined architecture; television studio broadcast; video coding standard H.264; video data compression; video surveillance; Computer architecture; Decoding; Hardware; Pipelines; Random access memory; Registers; Streaming media; H.264; Intra prediction; decoder;
Conference_Titel :
ASIC (ASICON), 2013 IEEE 10th International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4673-6415-7
DOI :
10.1109/ASICON.2013.6811847