DocumentCode :
2978333
Title :
An OS-level Framework for Providing Application-Aware Reliability
Author :
Wang, Long ; Kalbarczyk, Zbigniew ; Gu, Weining ; Iyer, Ravishankar K.
Author_Institution :
Center for Reliable & High Performance Comput., Illinois Univ., Urbana, IL
fYear :
2006
fDate :
Dec. 2006
Firstpage :
55
Lastpage :
62
Abstract :
The paper describes the reliability microkernel framework (RMK), a loadable kernel module for providing application-aware reliability and dynamically configuring reliability mechanisms installed in RMK. The RMK prototype is implemented in Linux and supports detection of application/OS failures and transparent application checkpointing. Experiment results show that the OS hang detection, which exploits characteristics of application and system behavior, can achieve high coverage (100% in our experiments) and low false positive rate. Moreover, the performance overhead is negligible because instruction counting is performed in hardware
Keywords :
checkpointing; operating system kernels; software reliability; Linux; OS-level framework; application-aware reliability; checkpointing; loadable kernel module; reliability microkernel framework; system behavior; Application software; Checkpointing; Computer architecture; Fault detection; Hardware; Kernel; Linux; Monitoring; Operating systems; Pins;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Computing, 2006. PRDC '06. 12th Pacific Rim International Symposium on
Conference_Location :
Riverside, CA
Print_ISBN :
0-7695-2724-8
Type :
conf
DOI :
10.1109/PRDC.2006.19
Filename :
4041888
Link To Document :
بازگشت