Title :
CUDA-based acceleration of the JPEG decoder
Author :
Ke Yan ; Junming Shan ; Eryan Yang
Author_Institution :
Key Lab. of Specialty, Fiber Opt. & Opt. Access Networks, Shanghai Univ., Shanghai, China
Abstract :
In this paper, an accelerated JPEG (Joint Photographic Experts Group) decoder was efficiently implemented on the GPU (Graphic Processing Unit) using the CUDA (Computer Unified Device Architecture) technology and it is capable for high definition images decoding. The CUDA technology can assist the GPU to work for the CPU for large computation. In this paper, the IDCT(Inverse Discrete Cosine Transform) model which has consumed about 75% the total time in the JPEG decoder is worked in the GPU by the CUDA, and other models of the JPEG decoder are designed in the CPU. At the same time, we use the asynchronous parallel execution between the CPU and the GPU to improve the JPEG decoder acceleration rate. In the experiment, the JPEG decoder based on the CUDA performs decompression of 3240 × 2160 pixels images, the implementation of the CUDA-based IDCT can be more than 49 times faster than the implementation on the CPU and the total processing time of the whole JPEG decoder can save about 50% time than the CPU.
Keywords :
decoding; discrete cosine transforms; graphics processing units; inverse transforms; parallel architectures; video coding; CUDA-based acceleration; GPU; IDCT; JPEG decoder; asynchronous parallel execution; graphic processing unit; inverse discrete cosine transform; joint photographic experts group decoder; Computational modeling; Data models; Decoding; Graphics processing units; Instruction sets; Registers; Transform coding;
Conference_Titel :
Natural Computation (ICNC), 2013 Ninth International Conference on
Conference_Location :
Shenyang
DOI :
10.1109/ICNC.2013.6818183