Title :
Implementation and optimization of intra prediction in H264 video parallel decoder on CUDA
Author :
Bocheng Liu ; Qingkui Chen
Author_Institution :
Univ. of Shanghai for Sci. & Technol., Shanghai, China
Abstract :
In order to analyze the quality of the massive H264 videos in the 3G network, we set up a GPU cluster to decode the multi-videos on CUDA and evaluated the clarities of the decoded frames. This paper focuses on parallel intra prediction and its optimization. By improving parallel algorithm, adjusting data structure and rationally using multilevel memories of GPU, we show that these operations achieve an average of 63.8% decrease of execution time comparing to original algorithm.
Keywords :
3G mobile communication; decoding; graphics processing units; parallel architectures; video coding; 3G network; CUDA; GPU; H264 video parallel decoder; H264 videos; data structure; multilevel memories; parallel algorithm; parallel intra prediction; Decoding; Graphics processing units; Instruction sets; Memory management; Optimization; Prediction algorithms; Registers;
Conference_Titel :
Advanced Computational Intelligence (ICACI), 2012 IEEE Fifth International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4673-1743-6
DOI :
10.1109/ICACI.2012.6463133