• DocumentCode
    2981407
  • Title

    A Parallel H.264 Encoder with CUDA: Mapping and Evaluation

  • Author

    Nan Wu ; Mei Wen ; Huayou Su ; Ju Ren ; Chunyuan Zhang

  • Author_Institution
    Comput. Sch., Nat. Univ. of Defense Technol., Changsha, China
  • fYear
    2012
  • fDate
    17-19 Dec. 2012
  • Firstpage
    276
  • Lastpage
    283
  • Abstract
    Efficient mapping of a real-time HD video application to graphics hardware is challenging. Developers face the challenges of choosing the right parallelism model, balancing thread´s process granularity between massive computing resources on the GPU, and partitioning tasks between the CPU and GPU. The paper illustrated the mapping approaches by a case of HD H.264 encoder based on X264 reference code and then evaluating it on state-of-the-art CPU and GPUs in depth. In the paper, we first split most of the computing task into Single-Instruction Multiple-Thread (SIMT) kernels, which are then chained intocertaininput/output data stream. Then we implementeda completed H.264 encoding on the computer unified device architecture (CUDA) platform. Finally, we present methods for exploiting multi-level parallelism and memory efficiency when mapping H.264 code, which we use to increase the efficiency of the execution on GPUs. Our experimental results show that computation efficiency of GPU and then real-time encoding performance are achieved with CUDA.
  • Keywords
    graphics processing units; parallel architectures; parallel processing; CPU; CUDA; GPU; SIMT; X264 reference code; computer unified device architecture; graphics hardware; memory efficiency; multilevel parallelism; parallel H.264 encoder; real-time HD video application; single instruction multiple thread kernels; thread process; Encoding; Filtering; Graphics processing units; Instruction sets; Kernel; Parallel processing; Streaming media; GPU Programming; High-performance Media Computing; Parallel Algorithm; Real Time Encode; Video Processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems (ICPADS), 2012 IEEE 18th International Conference on
  • Conference_Location
    Singapore
  • ISSN
    1521-9097
  • Print_ISBN
    978-1-4673-4565-1
  • Electronic_ISBN
    1521-9097
  • Type

    conf

  • DOI
    10.1109/ICPADS.2012.46
  • Filename
    6413686