DocumentCode :
3722585
Title :
Replacement: Decentralized Failure Handling for Replicated State Machines
Author :
Leander Jehl;Tormod Erevik Lea;Hein Meling
Author_Institution :
Univ. of Stavanger, Stavanger, Norway
fYear :
2015
Firstpage :
156
Lastpage :
165
Abstract :
We investigate methods for handling failures in a Paxos State Machine and introduce Replacement, a novel approach to handle failures. Replacement is fully decentralized and does not rely on consensus. This allows failed replicas to be replaced quickly, avoiding the bottleneck of a single leader. Instead of handling failures in the order proposed by a leader, concurrent replacements are combined to guarantee that all failed replicas are replaced. Replacement also allows the state machine to process client requests during failure handling, even while disagreeing on the current configuration. As our evaluation shows, this enables Replacement to quickly handle failures, with minimal disruption in the processing of client requests.
Keywords :
"Protocols","Fault tolerance","Fault tolerant systems","Delays","Synchronization","Buildings"
Publisher :
ieee
Conference_Titel :
Reliable Distributed Systems (SRDS), 2015 IEEE 34th Symposium on
Electronic_ISBN :
1060-9857
Type :
conf
DOI :
10.1109/SRDS.2015.29
Filename :
7371579
Link To Document :
بازگشت