DocumentCode :
3307152
Title :
ReViveI/O: efficient handling of I/O in highly-available rollback-recovery servers
Author :
Nakano, Jun ; Montesinos, Pablo ; Gharachorloo, Kourosh ; Torrellas, Josep
Author_Institution :
Illinois Univ., Champaign, IL, USA
fYear :
2006
fDate :
11-15 Feb. 2006
Firstpage :
200
Lastpage :
211
Abstract :
The increasing demand for reliable computers has led to proposals for hardware-assisted rollback of memory state. Such approach promises major reductions in mean time to repair (MTTR). The benefits are especially compelling for database servers, where existing recovery software typically leads to downtimes of tens of minutes. Unfortunately, adoption of such proposals is hindered by the lack of efficient mechanisms for I/O recovery. This paper presents and evaluates ReViveI/O, a scheme for I/O undo and redo that is compatible with mechanisms for hardware-assisted rollback of memory state. We have fully implemented a Linux-based prototype that shows that low-overhead, low-MTTR recovery of I/O is feasible. For 20-120 ms between checkpoints, a throughput-oriented workload such as TPC-C has negligible overhead. Moreover, for 50 ms or less between checkpoints, the response time of a latency-bound workload such as WebStone remains tolerable. In all cases, the recovery time of ReViveI/O is practically negligible. The result is a cost-effective highly-available server.
Keywords :
Linux; fault tolerant computing; shared memory systems; system recovery; I/O handling; Linux-based prototype; ReViveI/O; database server; hardware-assisted rollback-recovery server; mean time to repair; Application software; Checkpointing; Databases; Delay; Frequency; Hardware; Interleaved codes; Proposals; Prototypes; Web server;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High-Performance Computer Architecture, 2006. The Twelfth International Symposium on
ISSN :
1530-0897
Print_ISBN :
0-7803-9368-6
Type :
conf
DOI :
10.1109/HPCA.2006.1598129
Filename :
1598129
Link To Document :
بازگشت