Title :
Speeding up motion estimation algorithms on CUDA technology
Author :
Cheng, Ronghui ; Yang, Eryan. ; Liu, Ting
Author_Institution :
Sch. of Commun. & Inf. Eng., Shanghai Univ., Shanghai, China
Abstract :
Motion estimation(ME) is one of the most important modules in digital video encoding/decoding and video post processing. High-quality and fast ME algorithms are desired by many applications. In this paper, motion estimation algorithms are explored in terms of execution efficiency on the CUDA (Compute Unified Device Architecture) technology which is a parallel computing architecture developed by nVIDIA. In this paper, we implemented the full search algorithm (FS), the diamond search algorithm, and the four step algorithms in both the CPU and the CUDA platforms. Experiment results show that the CUDA-based implementations of these ME algorithms can be more than 8 times as fast as the those CPU-based ME implementations.
Keywords :
decoding; motion estimation; parallel architectures; search problems; video signal processing; compute unified device architecture technology; diamond search algorithm; digital video decoding; digital video encoding; full search algorithm; motion estimation algorithms; nVIDIA; parallel computing architecture; video post processing; Computer architecture; Graphics processing unit; Image edge detection; Instruction sets; Motion estimation; Pixel; Registers;
Conference_Titel :
Microelectronics and Electronics (PrimeAsia), 2010 Asia Pacific Conference on Postgraduate Research in
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-6735-8
Electronic_ISBN :
978-1-4244-6736-5
DOI :
10.1109/PRIMEASIA.2010.5604952