DocumentCode
3307152
Title
ReViveI/O: efficient handling of I/O in highly-available rollback-recovery servers
Author
Nakano, Jun ; Montesinos, Pablo ; Gharachorloo, Kourosh ; Torrellas, Josep
Author_Institution
Illinois Univ., Champaign, IL, USA
fYear
2006
fDate
11-15 Feb. 2006
Firstpage
200
Lastpage
211
Abstract
The increasing demand for reliable computers has led to proposals for hardware-assisted rollback of memory state. Such approach promises major reductions in mean time to repair (MTTR). The benefits are especially compelling for database servers, where existing recovery software typically leads to downtimes of tens of minutes. Unfortunately, adoption of such proposals is hindered by the lack of efficient mechanisms for I/O recovery. This paper presents and evaluates ReViveI/O, a scheme for I/O undo and redo that is compatible with mechanisms for hardware-assisted rollback of memory state. We have fully implemented a Linux-based prototype that shows that low-overhead, low-MTTR recovery of I/O is feasible. For 20-120 ms between checkpoints, a throughput-oriented workload such as TPC-C has negligible overhead. Moreover, for 50 ms or less between checkpoints, the response time of a latency-bound workload such as WebStone remains tolerable. In all cases, the recovery time of ReViveI/O is practically negligible. The result is a cost-effective highly-available server.
Keywords
Linux; fault tolerant computing; shared memory systems; system recovery; I/O handling; Linux-based prototype; ReViveI/O; database server; hardware-assisted rollback-recovery server; mean time to repair; Application software; Checkpointing; Databases; Delay; Frequency; Hardware; Interleaved codes; Proposals; Prototypes; Web server;
fLanguage
English
Publisher
ieee
Conference_Titel
High-Performance Computer Architecture, 2006. The Twelfth International Symposium on
ISSN
1530-0897
Print_ISBN
0-7803-9368-6
Type
conf
DOI
10.1109/HPCA.2006.1598129
Filename
1598129
Link To Document