Title :
Fast GPU parallel N-Body tree traversal with Simulated Wide-Warp
Author :
Nunan Zola, Wagner M. ; Bona, Luis C. E. ; Silva, Fabiano
Author_Institution :
Fed. Univ. of Parana, Curitiba, Brazil
Abstract :
The Barnes-Hut algorithm is a widely used approximation method for the N-Body simulation problem. The irregular nature of this tree walking code presents interesting challenges for its computation on parallel systems. Additional problems arise in effectively exploiting the processing capacity of GPU architectures. We propose and investigate the applicability of software Simulated Wide-Warps (SWW) in this context. To this extent, we explicitly deal with dynamic irregular patterns in data accesses with data remapping and data transformation, by controlling execution flow divergence of threads. We present a new compact data-structure for the tree layout, GPU parallel algorithms for tree transformation and parallel walking using SWW. Benefits of our techniques are in transposing the tree algorithm to execute regular patterns to match the GPU model. Our experiments show significant performance improvement over the best known GPU solutions to this algorithm.
Keywords :
graphics processing units; parallel algorithms; Barnes-Hut algorithm; GPU architecture; GPU parallel algorithm; SWW software; approximation method; data access; data remapping; data transformation; execution flow divergence; fast GPU parallel n-body tree traversal; graphics processing unit; n-body simulation problem; parallel system; simulated wide-warp software; Graphics processing units; Indexes; Instruction sets; Kernel; Layout; Octrees; Registers; Accelerator Computing; Barnes-Hut; CUDA; GPGPU; Implicit Octree; Manycore Computing; N-Body; Parallel algorithms; Software Simulated Wide-Warp;
Conference_Titel :
Parallel and Distributed Systems (ICPADS), 2014 20th IEEE International Conference on
DOI :
10.1109/PADSW.2014.7097874