Title :
A Parallel H.264 Encoder with CUDA: Mapping and Evaluation
Author :
Nan Wu ; Mei Wen ; Huayou Su ; Ju Ren ; Chunyuan Zhang
Author_Institution :
Comput. Sch., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
Efficient mapping of a real-time HD video application to graphics hardware is challenging. Developers face the challenges of choosing the right parallelism model, balancing thread´s process granularity between massive computing resources on the GPU, and partitioning tasks between the CPU and GPU. The paper illustrated the mapping approaches by a case of HD H.264 encoder based on X264 reference code and then evaluating it on state-of-the-art CPU and GPUs in depth. In the paper, we first split most of the computing task into Single-Instruction Multiple-Thread (SIMT) kernels, which are then chained intocertaininput/output data stream. Then we implementeda completed H.264 encoding on the computer unified device architecture (CUDA) platform. Finally, we present methods for exploiting multi-level parallelism and memory efficiency when mapping H.264 code, which we use to increase the efficiency of the execution on GPUs. Our experimental results show that computation efficiency of GPU and then real-time encoding performance are achieved with CUDA.
Keywords :
graphics processing units; parallel architectures; parallel processing; CPU; CUDA; GPU; SIMT; X264 reference code; computer unified device architecture; graphics hardware; memory efficiency; multilevel parallelism; parallel H.264 encoder; real-time HD video application; single instruction multiple thread kernels; thread process; Encoding; Filtering; Graphics processing units; Instruction sets; Kernel; Parallel processing; Streaming media; GPU Programming; High-performance Media Computing; Parallel Algorithm; Real Time Encode; Video Processing;
Conference_Titel :
Parallel and Distributed Systems (ICPADS), 2012 IEEE 18th International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4673-4565-1
Electronic_ISBN :
1521-9097
DOI :
10.1109/ICPADS.2012.46