DocumentCode
627127
Title
Speeding up the runtime performance for lossless image coding on GPUs with CUDA
Author
Lih-Jen Kau ; Chih-Shen Chen
Author_Institution
Dept. of Electron. Eng., Nat. Taipei Univ. of Technol., Taipei, Taiwan
fYear
2013
fDate
19-23 May 2013
Firstpage
2868
Lastpage
2871
Abstract
With the highly increased capability on parallel processing, computing on graphics processing units (GPUs) have been widely used in applications more than just graphics data processing. In this paper, we apply the compute unified device architecture (CUDA), a parallel computing architecture on GPUs proposed by NVIDIA, for the runtime performance enhancement in a predictively encoded lossless image compression system. For this, a least squares (LS)-adapted predictor, an effective approach for the removal of redundancy around boundaries, is applied. The adaptation process of an LS-based predictor requires multiplications of matrices for the construction of normal equations, which has been known to be the major complexity in LS adaptation process. Fortunately, matrices multiplication is most suitable to be parallel processed, which leads to the idea of speeding up the construction of normal equations with GPUs. With the proposed approach, a noticeable improvement on the runtime performance can be achieved as can be seen in the experiments.
Keywords
graphics processing units; image coding; least squares approximations; parallel architectures; CUDA; GPU; NVIDIA; compute unified device architecture; graphics processing units; least squares-adapted predictor; lossless image coding; runtime performance; Encoding; Equations; Graphics processing units; Image coding; Image edge detection; Mathematical model; Runtime;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems (ISCAS), 2013 IEEE International Symposium on
Conference_Location
Beijing
ISSN
0271-4302
Print_ISBN
978-1-4673-5760-9
Type
conf
DOI
10.1109/ISCAS.2013.6572477
Filename
6572477
Link To Document