Title of article
A sparse octree gravitational N-body code that runs entirely on the GPU processor
Author/Authors
Bédorf، نويسنده , , Jeroen and Gaburov، نويسنده , , Evghenii and Portegies Zwart، نويسنده , , Simon، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2012
Pages
15
From page
2825
To page
2839
Abstract
We present the implementation and performance of a new gravitational N-body tree-code that is specifically designed for the graphics processing unit (GPU).1The code is publicly available at: http://castle.strw.leidenuniv.nl/software.html.
parts of the tree-code algorithm are executed on the GPU. We present algorithms for parallel construction and traversing of sparse octrees. These algorithms are implemented in CUDA and tested on NVIDIA GPUs, but they are portable to OpenCL and can easily be used on many-core devices from other manufacturers. This portability is achieved by using general parallel-scan and sort methods. The gravitational tree-code outperforms tuned CPU code during the tree-construction and shows a performance improvement of more than a factor 20 overall, resulting in a processing rate of more than 2.8 million particles per second.
Keywords
hierarchical , GPU , parallel , Tree-code , N-body , Gravity
Journal title
Journal of Computational Physics
Serial Year
2012
Journal title
Journal of Computational Physics
Record number
1484248
Link To Document