DocumentCode :
41931
Title :
Parallel H.264/AVC Fast Rate-Distortion Optimized Motion Estimation by Using a Graphics Processing Unit and Dedicated Hardware
Author :
Shahid, Muhammad Usman ; Ahmed, Ashfaq ; Martina, Maurizio ; Masera, Guido ; Magli, Enrico
Author_Institution :
Politec. di Torino, Turin, Italy
Volume :
25
Issue :
4
fYear :
2015
fDate :
Apr-15
Firstpage :
701
Lastpage :
715
Abstract :
Heterogeneous systems on a single chip composed of a central processing unit, graphics processing unit (GPU), and field-programmable gate array (FPGA) are expected to emerge in the near future. In this context, the system on chip can be dynamically adapted to employ different architectures for execution of data-intensive applications. Motion estimation (ME) is one such task that can be accelerated using FPGA and GPU for high-performance H.264/Advanced Video Coding encoder implementation. This paper presents an inherent parallel low-complexity rate-distortion (RD) optimized fast ME algorithm well suited for parallel implementations, eliminating various data dependencies caused by a reliance on spatial predictions. In addition, this paper provides details of the GPU and FPGA implementations of the parallel algorithm by using OpenCL and Very High Speed Integrated Circuits (VHSIC) Hardware Descriptive Language (VHDL), respectively, and presents a practical performance comparison between the two implementations. The experimental results show that the proposed scheme achieves significant speedup on GPU and FPGA, and has comparable RD performance with respect to sequential fast ME algorithm.
Keywords :
field programmable gate arrays; graphics processing units; hardware description languages; motion estimation; rate distortion theory; system-on-chip; very high speed integrated circuits; video coding; FPGA; OpenCL; VHDL; VHSIC; advanced video coding; central processing unit; data intensive applications; dedicated hardware; field programmable gate array; graphics processing unit; hardware descriptive language; heterogeneous systems; inherent parallel low complexity; motion estimation; parallel H.264-AVC; parallel implementations; rate distortion; spatial predictions; system on chip; very high speed integrated circuits; Algorithm design and analysis; Computer architecture; Field programmable gate arrays; Graphics processing units; Motion estimation; Prediction algorithms; Video coding; Field-programmable gate array (FPGA); H.264/Advanced Video Coding (AVC); OpenCL; graphics processing unit (GPU); parallel fast motion estimation (ME);
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2014.2351111
Filename :
6882206
Link To Document :
بازگشت