DocumentCode
2140206
Title
CUDA-based acceleration of the JPEG decoder
Author
Ke Yan ; Junming Shan ; Eryan Yang
Author_Institution
Key Lab. of Specialty, Fiber Opt. & Opt. Access Networks, Shanghai Univ., Shanghai, China
fYear
2013
fDate
23-25 July 2013
Firstpage
1319
Lastpage
1323
Abstract
In this paper, an accelerated JPEG (Joint Photographic Experts Group) decoder was efficiently implemented on the GPU (Graphic Processing Unit) using the CUDA (Computer Unified Device Architecture) technology and it is capable for high definition images decoding. The CUDA technology can assist the GPU to work for the CPU for large computation. In this paper, the IDCT(Inverse Discrete Cosine Transform) model which has consumed about 75% the total time in the JPEG decoder is worked in the GPU by the CUDA, and other models of the JPEG decoder are designed in the CPU. At the same time, we use the asynchronous parallel execution between the CPU and the GPU to improve the JPEG decoder acceleration rate. In the experiment, the JPEG decoder based on the CUDA performs decompression of 3240 × 2160 pixels images, the implementation of the CUDA-based IDCT can be more than 49 times faster than the implementation on the CPU and the total processing time of the whole JPEG decoder can save about 50% time than the CPU.
Keywords
decoding; discrete cosine transforms; graphics processing units; inverse transforms; parallel architectures; video coding; CUDA-based acceleration; GPU; IDCT; JPEG decoder; asynchronous parallel execution; graphic processing unit; inverse discrete cosine transform; joint photographic experts group decoder; Computational modeling; Data models; Decoding; Graphics processing units; Instruction sets; Registers; Transform coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation (ICNC), 2013 Ninth International Conference on
Conference_Location
Shenyang
Type
conf
DOI
10.1109/ICNC.2013.6818183
Filename
6818183
Link To Document