Title :
GPU Accelerated Solver of Time-Dependent Air Pollutant Transport Equations
Author :
Simek, Vaclav ; Dvorak, Radim ; Zboril, Frantisek ; Drabek, Vladimir
Author_Institution :
Fac. of Inf. Technol., Brno Univ. of Technol., Brno, Czech Republic
Abstract :
Main objective of this paper is to outline possible ways how to achieve a substantial acceleration in case of advection-diffusion equation (A-DE) calculation, which is commonly used for a description of the pollutant behavior in atmosphere. A-DE is a land of partial differential equation (PDE) and in general case it is usually solved by numerical integration due to its high complexity. These types of calculations are time consuming thus the main idea of our work is to adopt compute unified device architecture (CUDA) software framework and commodity add-on card with general-purpose graphics processing unit (GPU) to do the calculations in a faster way. The solution is based on method of lines with 4 order Runge-Kutta scheme to handle the integration. As a matter of fact, the selected approach involves number of auxiliary variables and thus the memory management is critical in order to achieve desired performance. From a technical point of view, we have implemented a particular variant of the A-DE system, where the pollutant concentration is time-dependent. An efficient data handling is primarily based on the exploitation of shared memory blocks and texture caches inside GPU chip. Detailed evaluation of the obtained results is given in this paper where an astonishing execution speed up of GPU-based solution is demonstrated in comparison to standard CPU.
Keywords :
Runge-Kutta methods; air pollution; cache storage; computer graphic equipment; environmental factors; environmental science computing; integration; microprocessor chips; partial differential equations; storage management; GPU accelerated solver; GPU chip; Runge-Kutta scheme; advection-diffusion equation calculation; commodity add-on card; compute unified device architecture software framework; general-purpose graphics processing unit; memory management; numerical integration; partial differential equation; pollutant behavior; pollutant concentration; shared memory blocks; texture caches; time-dependent air pollutant transport equations; Acceleration; Air pollution; Computer architecture; Design methodology; Differential equations; Digital systems; Graphics; Hardware; Information technology; Partial differential equations; A-DE; CUDA; GPU; PDE; acceleration; parallel execution; pollutant transport equation;
Conference_Titel :
Digital System Design, Architectures, Methods and Tools, 2009. DSD '09. 12th Euromicro Conference on
Conference_Location :
Patras
Print_ISBN :
978-0-7695-3782-5
DOI :
10.1109/DSD.2009.146