Title :
Checkpoint and Run-Time Adaptation with Pluggable Parallelisation
Author :
Medeiros, Bruno ; Sobral, João L.
Author_Institution :
Dept. de Inf./CCTC, Univ. do Minho, Braga, Portugal
Abstract :
Enabling applications for computational Grids requires new approaches to develop applications that can effectively cope with resource volatility. Applications must be resilient to resource faults, adapting the behaviour to available resources. This paper describes an approach to application-level adaptation that efficiently supports application-level check pointing. The key of this work is the concept of pluggable parallelisation, which localises parallelisation issues into multiple modules that can be (un)plugged to match resource availability. This paper shows how pluggable parallelisation can be extended to effectively support check pointing and run-time adaptation. We present the developed pluggable mechanism that helps the programmer to include check pointing in the base (sequential). Based on these mechanisms and on previous work on pluggable parallelisation, our approach is able to automatically add support for check pointing in parallel execution environments. Moreover, applications can adapt from a sequential execution to a multi-cluster configuration. Adaptation can be performed by check pointing the application and restarting on a different mode or can be performed during run-time. Pluggable parallelisation intrinsically promotes the separation of software functionality from fault-tolerance and adaptation issues facilitating their analysis and evolution. The work presented in this paper reinforces this idea by showing the feasibility of the approach and performance benefits that can be achieved.
Keywords :
checkpointing; grid computing; parallel processing; software fault tolerance; application-level check pointing; computational grids; fault-tolerance; multicluster configuration; pluggable parallelisation; run-time adaptation; software functionality; Aggregates; Checkpointing; Fault tolerance; Fault tolerant systems; Instruction sets; Parallel processing; Programming; application-level checkpointing; aspect oriented programming; pluggable parallelisation; run-time adaptation;
Conference_Titel :
Parallel Processing (ICPP), 2011 International Conference on
Conference_Location :
Taipei City
Print_ISBN :
978-1-4577-1336-1
Electronic_ISBN :
0190-3918
DOI :
10.1109/ICPP.2011.83