• DocumentCode
    2868071
  • Title

    Exploiting Multi- and Many-core Parallelism for Accelerating Image Compression

  • Author

    Kim, Cheong Ghil ; Choi, Yong Soo

  • Author_Institution
    Dept. of Comput. Sci., Namseoul Univ., Cheonan, South Korea
  • fYear
    2011
  • fDate
    28-30 June 2011
  • Firstpage
    12
  • Lastpage
    17
  • Abstract
    With the help of recent development on semiconductor design and process technologies modern processors can provide a great opportunity to increase the performance of processing multimedia data by exploiting task- and data-parallelism in heterogeneous system consisting of multi-core CPUs and many-core GPUs (graphical processing units). This paper presents an optimization of 2D DCT (discrete cosine transform), a computation intensive signal processing algorithm widely used in compression standards, on speed in multi-core CPUs and many-core GPUs. Two optimization techniques using Intel TBB (threading building blocks) and Open CL are discussed in detail. Open CL is a recent open standard proposed to provide universal APIs and programming paradigms for various GPUs and accelerators, it can provide massively parallel processing suitable for data intensive multimedia applications with very low cost. The simulation result that the parallel DCT implementations are performed considerably faster than the serial ones, max 4.8 and 6.9 times speedup as for TBB and Open CL, respectively. Especially, Open CL implementation on GPU shows a linear speedup, a typical characteristic of massively parallel processing, as the increase of 2D data sets.
  • Keywords
    computer graphic equipment; coprocessors; data compression; discrete cosine transforms; image coding; multimedia computing; multiprocessing systems; optimisation; parallel processing; 2D data sets; Intel TBB; OpenCL; data intensive multimedia application; data parallelism; discrete cosine transform; graphical processing unit; heterogeneous system; image compression standard; intensive signal processing algorithm; manycore GPU; manycore parallelism; multicore CPU; multicore parallelism; multimedia data processing; optimization technique; parallel DCT implementation; parallel processing; process technology; programming paradigm; semiconductor design; task parallelism; universal API; Multimedia communication; DCT; GPU; OpenCL; TBB; massively parallel programming; task-level parallelsim;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Ubiquitous Engineering (MUE), 2011 5th FTRA International Conference on
  • Conference_Location
    Loutraki
  • Print_ISBN
    978-1-4577-1228-9
  • Electronic_ISBN
    978-0-7695-4470-0
  • Type

    conf

  • DOI
    10.1109/MUE.2011.13
  • Filename
    5992185