DocumentCode :
145395
Title :
A CUDA Based Implementation of Locally-and Feature-Adaptive Diffusion Based Image Denoising Algorithm
Author :
Yazdanpanah, Ali Pour ; Mandava, Ajay K. ; Regentova, Emma E. ; Muthukumar, Vishak ; Bebis, G.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Nevada, Las Vegas, NV, USA
fYear :
2014
fDate :
7-9 April 2014
Firstpage :
388
Lastpage :
393
Abstract :
In this paper we introduce a parallel implementation of locally-and feature-adaptive diffusion based (LFAD) method for image denoising using NVIDIA CUDA framework and graphics processing units (GPUs). LFAD is a novel method for removing additive white Gaussian (AWG) noise in images reported to yield high quality denoised images [1]. It approaches each image region separately and uses different number of nonlinear anisotropic diffusion iterations for each region to attain best peak signal to noise ratio (PSNR). The inverse difference moment (IDM) feature is embedded into a modified diffusion function. As the method has attained highest performance in the class of advanced diffusion based methods and it is competitive with all the state-of-the-art methods, however computationally intensive when executed on the general purpose CPU. To improve the performance, we implemented using the CUDA computational framework. In order to minimize GPU kernel access to the global memory, we use shared memory and the texture memory per multiprocessor. The performance of the GPU implementation of the LFAD has been tested on the standard benchmark images. We demonstrate that with a single NVIDIA Tesla C2050 GPU we can expedite the sequential CPU implementation in most cases from 13 to 20 times.
Keywords :
AWGN; graphics processing units; image denoising; parallel architectures; shared memory systems; AWG noise removal; CUDA based implementation; IDM; LFAD method; NVIDIA CUDA framework; NVIDIA Tesla C2050 GPU; PSNR; additive white Gaussian noise removal; feature-adaptive diffusion based image denoising algorithm; general purpose CPU; graphics processing units; high quality denoised images; inverse difference moment; locally-adaptive diffusion based image denoising algorithm; multiprocessor; nonlinear anisotropic diffusion iterations; peak signal to noise ratio; sequential CPU implementation; shared memory; texture memory; Computer architecture; Graphics processing units; Instruction sets; Kernel; Merging; Noise reduction; PSNR; CUDA Implementation; GPU; Image Denoising; LFAD; NVIDIA;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: New Generations (ITNG), 2014 11th International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4799-3187-3
Type :
conf
DOI :
10.1109/ITNG.2014.113
Filename :
6822228
Link To Document :
بازگشت