DocumentCode :
3470554
Title :
A Study of the Performance of Multifluid PPM Gas Dynamics on CPUs and GPUs
Author :
Pei-Hung Lin ; Jayaraj, Jagan ; Woodward, Paul R.
Author_Institution :
Lab. for Comput. Sci. & Eng., Univ. of Minnesota, Minneapolis, MN, USA
fYear :
2011
fDate :
19-21 July 2011
Firstpage :
42
Lastpage :
51
Abstract :
The potential for GPUs and many-core CPUs to support high performance computation in the area of computational fluid dynamics (CFD) is explored quantitatively through the example of the PPM gas dynamics code with PPB multi fluid volume fraction advection. This code has already been implemented on the IBM Cell processor and run at full scale on the Los Alamos Roadrunner machine. This implementation has involved a complete restructuring of the code that has been described in detail elsewhere. Here the lessons learned from that work are exploited to take advantage oftoday´s latest generations of multi-core CPUs and many-core GPUs. The operations performed by this code are characterized in detail after being first decomposed into a series of individual code kernels to allow an implementation on GPUs. Careful implementations of this code for both CPUs and GPU sare then contrasted from a performance point of view. In addition, a single kernel that has many of the characteristics of the full application on CPUs has been built into a full, standalone, scalable parallel application. This single-kernel application shows the GPU at its best. In contrast, the full multi fluid gas dynamics application brings into play computational requirements that highlight the essential differences in CPU and GPU designs today and the different programming strategies needed to achieve the best performance for applications of this type on the two devices. The single kernel application code performs extremely well on both platforms. This application is not limited by main memory bandwidth on either device instead it is limited only by the computational capability of each. In this case, the GPU has the advantage, because it has more computational cores. The full multi fluid gas dynamics code is, however, of necessity memory bandwidth limited on the GPU, while it is still computational capability limited on the CPU. We believe that these codes provide a useful context for quantifying the costs a- - nd benefits of design decisions for these powerful new computing devices. Suggestions for improvements in both devices and codes based upon this work are offered in our conclusions.
Keywords :
computational fluid dynamics; computer graphic equipment; coprocessors; multiprocessing systems; CFD; GPU; IBM Cell processor; Los Alamos Roadrunner machine; PPB multifluid volume fraction advection; code kernels; computational capability; computational fluid dynamics; many-core CPU; memory bandwidth; multicore CPU; multifluid PPM gas dynamics; Arrays; Engines; Graphics processing unit; Kernel; Message systems; Performance evaluation; Workstations; GPGPU; computational fluid dynamics; exascale computing; high-performance computing; multicore CPU; parallel programming; source-to-source transformation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Application Accelerators in High-Performance Computing (SAAHPC), 2011 Symposium on
Conference_Location :
Knoxville, TN
Print_ISBN :
978-1-4577-0635-6
Electronic_ISBN :
978-0-7695-4448-9
Type :
conf
DOI :
10.1109/SAAHPC.2011.27
Filename :
6031562
Link To Document :
بازگشت