Title :
GPU Implementation of Stony Brook University 5-Class Cloud Microphysics Scheme in the WRF
Author :
Mielikainen, Jarno ; Huang, Bormin ; Huang, Hung-Lung Allen ; Goldberg, Mitchell D.
Author_Institution :
Space Sci. & Eng. Center, Univ. of Wisconsin, Madison, WI, USA
fDate :
4/1/2012 12:00:00 AM
Abstract :
The Weather Research and Forecasting (WRF) model is a next-generation mesoscale numerical weather prediction system. It is designed to serve the needs of both operational forecasting and atmospheric research for a broad spectrum of applications across scales ranging from meters to thousands of kilometers. Microphysics plays an important role in weather and climate prediction. Microphysics includes explicitly resolved water vapor, cloud, and precipitation processes. Several bulk water microphysics schemes are available within the WRF, with different numbers of simulated hydrometeor classes and methods for estimating their size, fall speeds, distributions and densities. Stony Brook University scheme is a 5-class scheme with riming intensity predicted to account for the mixed-phase processes. In this paper, we develop an efficient Graphics Processing Unit (GPU) based Stony Brook University scheme. The GPU-based Stony Brook University scheme was compared to a CPU-based single-threaded counterpart on a computational domain of 422 × 297 horizontal grid points with 34 vertical levels. The original Fortran code was first rewritten into a standard C code. After that, C code was verified against Fortran code and CUDA C extensions were added for data parallel execution on GPUs. On a single GPU, we achieved a speed-up of 213× with data I/O and 896 × without I/O on NVIDIA GTX 590. Using multiple GPUs, a speed-up of 352 × is achieved with I/O for 4 GPUs. We will also discuss how data I/O will be less cumbersome if we ran the complete WRF model on GPUs.
Keywords :
FORTRAN; atmospheric humidity; atmospheric precipitation; atmospheric techniques; clouds; geophysics computing; graphics processing units; parallel architectures; weather forecasting; 5-class cloud microphysics scheme; CPU-based single-threaded counterpart; CUDA C; Fortran code; GPU; NVIDIA GTX 590; Stony Brook University scheme; WRF model; atmospheric research analysis; bulk water microphysics schemes; climate prediction; computational domain; data parallel execution; efficient graphics processing unit; horizontal grid points; hydrometeor classes; mixed-phase processes; next-generation mesoscale numerical weather prediction system; precipitation process; riming intensity; standard C code; water vapor process; weather prediction; weather research and forecasting model; Clouds; Computational modeling; Educational institutions; Graphics processing unit; Instruction sets; Meteorology; Parallel processing; Compute Unified Device Architecture (CUDA); Graphics Processing Unit (GPU); Stony Brook University 5-class cloud microphysics scheme; Weather Research and Forecasting (WRF) model;
Journal_Title :
Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of
DOI :
10.1109/JSTARS.2011.2175707