Title :
Exploiting Multi- and Many-core Parallelism for Accelerating Image Compression
Author :
Kim, Cheong Ghil ; Choi, Yong Soo
Author_Institution :
Dept. of Comput. Sci., Namseoul Univ., Cheonan, South Korea
Abstract :
With the help of recent development on semiconductor design and process technologies modern processors can provide a great opportunity to increase the performance of processing multimedia data by exploiting task- and data-parallelism in heterogeneous system consisting of multi-core CPUs and many-core GPUs (graphical processing units). This paper presents an optimization of 2D DCT (discrete cosine transform), a computation intensive signal processing algorithm widely used in compression standards, on speed in multi-core CPUs and many-core GPUs. Two optimization techniques using Intel TBB (threading building blocks) and Open CL are discussed in detail. Open CL is a recent open standard proposed to provide universal APIs and programming paradigms for various GPUs and accelerators, it can provide massively parallel processing suitable for data intensive multimedia applications with very low cost. The simulation result that the parallel DCT implementations are performed considerably faster than the serial ones, max 4.8 and 6.9 times speedup as for TBB and Open CL, respectively. Especially, Open CL implementation on GPU shows a linear speedup, a typical characteristic of massively parallel processing, as the increase of 2D data sets.
Keywords :
computer graphic equipment; coprocessors; data compression; discrete cosine transforms; image coding; multimedia computing; multiprocessing systems; optimisation; parallel processing; 2D data sets; Intel TBB; OpenCL; data intensive multimedia application; data parallelism; discrete cosine transform; graphical processing unit; heterogeneous system; image compression standard; intensive signal processing algorithm; manycore GPU; manycore parallelism; multicore CPU; multicore parallelism; multimedia data processing; optimization technique; parallel DCT implementation; parallel processing; process technology; programming paradigm; semiconductor design; task parallelism; universal API; Multimedia communication; DCT; GPU; OpenCL; TBB; massively parallel programming; task-level parallelsim;
Conference_Titel :
Multimedia and Ubiquitous Engineering (MUE), 2011 5th FTRA International Conference on
Conference_Location :
Loutraki
Print_ISBN :
978-1-4577-1228-9
Electronic_ISBN :
978-0-7695-4470-0
DOI :
10.1109/MUE.2011.13