Title :
HG-ckpt: A tybrid granularity checkpoint frame
Author :
Li, Xu ; Lu, Kai ; Zhou, Xu
Author_Institution :
Nat. Univ. of Defense Technol., Changsha, China
Abstract :
Checkpoint-restart techniques are very important fault-tolerance approaches for these systems. As we know, the overhead of checkpoint will influence the system performance seriously. Incremental checkpoint is a well researched technique to reduce the overhead of checkpoint. However, huge memory footprints of parallel applications place severe limitations on scalability of conventional incremental checkpointing techniques. Recently, the emerging Phase Change Memory technology is becoming available and can replace disks with the fast accesses and byte-address features. In this paper, we propose and implement a new incremental checkpoint frame named HG-ckpt, which uses both page-level and byte-level granularity to detect dirty data. Our experiment results show that HG-ckpt can reduce checkpoint file size up to 50% without other overhead when compared to conventional page-level incremental checkpoint on PCM.
Keywords :
checkpointing; fault tolerant computing; phase change memories; HG-CKPT; PCM; byte-address features; byte-level granularity; checkpoint-restart techniques; fast access features; fault-tolerance approach; hybrid granularity checkpoint frame; incremental checkpointing techniques; page-level granularity; page-level incremental checkpoint; parallel application huge memory footprints; phase change memory technology; Benchmark testing; Checkpointing; Conferences; Fault tolerance; Fault tolerant systems; Phase change materials; Random access memory;
Conference_Titel :
Information Science and Technology (ICIST), 2012 International Conference on
Conference_Location :
Hubei
Print_ISBN :
978-1-4577-0343-0
DOI :
10.1109/ICIST.2012.6221697