Title :
Techniques for efficient DCT/IDCT implementation on generic GPU
Author :
Fang, Bo ; Shen, Guobin ; Li, Shipeng ; Chen, Huifang
Author_Institution :
Dept. of Inf. Sci. & Electron. Eng., Zhejiang Univ., Hangzhou, China
Abstract :
The emergence of programmable graphics processing units (GPU) has led to increasing interest in off-loading numerically intensive computations on to graphics hardware. DCT/IDCT is widely adopted in modern image/video compression standards and is usually one of the most computationally expensive parts. We present several techniques for efficient implementation of DCT/IDCT on generic programmable GPU, using direct matrix multiplication. Our experimental results demonstrate that the speed of IDCT on a GPU using the proposed techniques can well exceed that on a CPU with MMX optimization.
Keywords :
computer graphics; coprocessors; data compression; discrete cosine transforms; image coding; matrix multiplication; video coding; CPU; DCT; IDCT; MMX optimization; direct matrix multiplication; generic programmable graphics processing units; graphics hardware; image compression; numerically intensive computations; video compression; Asia; Central Processing Unit; Discrete cosine transforms; Graphics; Hardware; Information science; Internet; Kernel; Matrix decomposition; Transform coding;
Conference_Titel :
Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on
Print_ISBN :
0-7803-8834-8
DOI :
10.1109/ISCAS.2005.1464791