Title :
Solving Large Nonlinear Systems of First-Order Ordinary Differential Equations With Hierarchical Structure Using Multi-GPGPUs and an Adaptive Runge Kutta ODE Solver
Author :
AL-Omari, Ahmad ; Arnold, Jonathan ; Taha, Thiab ; Schuttler, Heinz-Bernd
Author_Institution :
Inst. of Bioinf., Univ. of Georgia, Athens, GA, USA
Abstract :
The adaptive Runge-Kutta (ARK) method on multi-general-purpose graphical processing units (GPUs) is used for solving large nonlinear systems of first-order ordinary differential equations (ODEs) with over ~ 10 000 variables describing a large genetic network in systems biology for the biological clock. To carry out the computation of the trajectory of the system, a hierarchical structure of the ODEs is exploited, and an ARK solver is implemented in compute unified device architecture/C++ (CUDA/C++) on GPUs. The result is a 75-fold speedup for calculations of 2436 independent modules within the genetic network describing clock function relative to a comparable CPU architecture. These 2436 modules span one-quarter of the entire genome of a model fungal system, Neurospora crassa. The power of a GPU can in principle be harnessed by using warp-level parallelism, instruction level parallelism or both of them. Since the ARK ODE solver is entirely sequential, we propose a new parallel processing algorithm using warp-level parallelism for solving ~ 10 000 ODEs that belong to a large genetic network describing clock genome-level dynamics. A video is attached illustrating the general idea of the method on GPUs that can be used to provide new insights into the biological clock through single cell measurements on the clock.
Keywords :
Runge-Kutta methods; bioinformatics; differential equations; graphics processing units; mathematics computing; nonlinear equations; nonlinear systems; parallel algorithms; parallel architectures; ARK method; ARK solver; CPU architecture; CUDA-C++; adaptive runge Kutta ODE solver; biological clock function; clock genome-level dynamics; compute unified device architecture; first-order ordinary differential equations; hierarchical structure; instruction level parallelism; large genetic network; large nonlinear systems; model fungal system; multiGPGPUs; multigeneral-purpose graphical processing units; neurospora crassa; parallel processing algorithm; single cell measurements; systems biology; warp-level parallelism; Bioinformatics; Clocks; Genomics; Graphics processing units; Instruction sets; Parallel processing; Bioinformatics; adaptive Runge–Kutta integration; biological clock; finite element method; general-purpose graphical processing unit; ordinary differential equation; systems biology; warp-level parallelism;
Journal_Title :
Access, IEEE
DOI :
10.1109/ACCESS.2013.2290623