DocumentCode :
1898283
Title :
NV-TS: A Fault Tolerance Transaction System Based on Persistent Memory
Author :
Li, Xu ; Lu, Kai ; Zhou, Xu
Author_Institution :
Nat. Univ. of Defense Technol., Changsha, China
Volume :
2
fYear :
2012
fDate :
23-25 March 2012
Firstpage :
221
Lastpage :
224
Abstract :
The scalability of future high performance computing (HPC) systems are challenged by high failure rates. So fault tolerance technique will play a more important role in future HPC field. Currently, the checkpoint-restart technique is the main fault tolerance technique. However, the checkpoint-restart approach results in a very high overhead, which influences the efficiency of HPC systems seriously. In this paper, we leverage the emerging NVRAM technology and propose to combine transaction and NVRAM technique to design a new fault tolerance technique. We present NV-TS, a fault tolerance transaction system based on NVRAM. NV-TS guarantee that the update of application state is atomic and durable. If the system crashes suddenly during the application execution, the atomicity of transaction will ensure the consistency of application state. After the system restarts, the application could continue to run. Our experiment shows that NV-TS could improve the performance of fault tolerance with a small memory overhead.
Keywords :
checkpointing; fault tolerant computing; random-access storage; storage management; HPC system; NV-TS; NVRAM technology; checkpoint-restart technique; failure rate; fault tolerance technique; fault tolerance transaction system; high performance computing; memory overhead; persistent memory; transaction atomicity; Fault tolerance; Fault tolerant systems; Memory management; Nonvolatile memory; Phase change random access memory; USA Councils; Fault tolerance; Performance; Persistent memor; Transaction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Electronics Engineering (ICCSEE), 2012 International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-0689-8
Type :
conf
DOI :
10.1109/ICCSEE.2012.274
Filename :
6188006
Link To Document :
بازگشت