Title :
Towards constructing application-level GPU computation states
Author :
Yulu Zhang ; Xinyuan Guo ; Hai Jiang ; Kuan-Ching Li
Author_Institution :
Dept. of Comput. Sci., Arkansas State Univ., Jonesboro, AR, USA
Abstract :
Computation state construction is an indispensable step to achieve fault tolerance and computation mobility for scientific applications by saving and restoring the state during program execution. However, there is no effective state construction scheme yet due to the GPU´s batch-mode execution manner as the GPU takes on a larger role in high performance computing. The GPU´s complex memory hierarchy means the states are scattered in different memory locations that are difficult to fetch. Programs that are running in parallel make the states difficult to construct for each thread. The paper proposes an application-level computation state construction scheme to support GPU programs. A precompiler and run-time support module are developed to construct and save states in the CPU system memory dynamically. Memory blocks are registered, and new data structures are proposed to save and restore the computation states represented by variables and pointers in the GPU. Secondary storage can be utilized for scalability and long-term fault tolerance.
Keywords :
application program interfaces; data structures; fault tolerance; graphics processing units; parallel memories; parallel programming; program compilers; storage management; CPU system memory; GPU; application level computation state construction; complex memory hierarchy; computation state restoration; computation state saving; data structures; fault tolerance; high performance computing; memory block; mobility computation; precompiler; program execution; run time support module; scalability; scientific application; Data structures; Graphics processing units; Instruction sets; Kernel; Libraries; Radiation detectors; Registers; CUDA; GPU; state construction;
Conference_Titel :
Computer and Information Science (ICIS), 2013 IEEE/ACIS 12th International Conference on
Conference_Location :
Niigata
DOI :
10.1109/ICIS.2013.6607834