Title : 
Anisotropic nonlinear diffusion for filtering 3D images on GPUs
         
        
            Author : 
Tabik, Siham ; Murarasu, Alin ; Romero, Luis F.
         
        
            Author_Institution : 
Dept. of Comput. Archit., Univ. of Malaga, Malaga, Spain
         
        
        
        
        
        
            Abstract : 
Optimizing sophisticated PDE-based filtering methods, such as the Anisotropic Nonlinear Diffusion (AND), to GPUs is complicated and time consuming. In this work, we expressed AND as iterative multiple 3D-stencils, where each 3D-stencil is implemented into one kernel, and then we analyzed all possible kernel fusions on the GPU. We experimentally found that fusing dependent stencils with similar concurrency and lower on-chip pressure makes the optimal combination run 1, 52× faster than the next better one.
         
        
            Keywords : 
filtering theory; graphics processing units; image processing; 3D image filtering; AND; GPU; PDE based filtering methods; anisotropic nonlinear diffusion; onchip pressure; Graphics processing units; Instruction sets; Kernel; Multicore processing; Optimization; Smoothing methods; Tensile stress;
         
        
        
        
            Conference_Titel : 
Cluster Computing (CLUSTER), 2014 IEEE International Conference on
         
        
            Conference_Location : 
Madrid
         
        
        
            DOI : 
10.1109/CLUSTER.2014.6968786