DocumentCode :
235415
Title :
A checkpointing and instant-on mechanism for a embedded system based on non-volatile memories
Author :
Jianwen Sun ; Xiang Long ; Han Wan ; Jingwei Yang
Author_Institution :
Sch. of Comput. Sci. & Eng., Beihang Univ., Beijing, China
fYear :
2014
fDate :
20-22 Oct. 2014
Firstpage :
173
Lastpage :
178
Abstract :
Checkpointing is the act of saving the state of a running program so that it may be recovered later, which is a general idea that enables various functionalities in computer systems, including fault tolerance, system recovery, and process migration. Checkpointing mechanisms in traditional systems normally save the state of process running on volatile memory to a checkpoint file stored on non-volatile disks. However, moving checkpoint data between decoupled memory and storage levels can be significantly inefficient. Emerging non-volatile memory technology with their large, fast and persistent properties has the potential to finalize this traditional decoupled memory/storage model. Implementing checkpointing mechanism in a system based on non-volatile memories can largely improve the performance of checkpointing. We have implemented an embedded platform based on magnetic random access memories and a checkpointing mechanism in this system. Our results show that checkpointing performance based on non-volatile memories can be order of magnitudes faster than traditional disk-based (or flash-based) approaches and that it is possible to conduct checkpointing operation in a shorter period. Restarting procedure in our platform can be done instantaneously, which shows possibilities of implementing instant-on.
Keywords :
checkpointing; embedded systems; fault tolerant computing; magnetic storage; random-access storage; checkpoint file; checkpointing mechanism; computer systems; decoupled memory; embedded platform; embedded system; fault tolerance; instant-on mechanism; magnetic random access memories; nonvolatile disks; nonvolatile memories; process migration; restarting procedure; running program; system recovery; Checkpointing; Embedded systems; Kernel; Nonvolatile memory; Performance evaluation; Process control; Random access memory; Checkpoint; Fault Tolerance; Instant-on; Magnetic Random Access Memory; Persistent Memory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing, Communications and IT Applications Conference (ComComAp), 2014 IEEE
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-4813-0
Type :
conf
DOI :
10.1109/ComComAp.2014.7017191
Filename :
7017191
Link To Document :
بازگشت