Title :
Efficient treatment of failures in RPC systems
Author :
Issarny, Valérie ; Muller, Gilles ; Puaut, Isabelle
Author_Institution :
INRIA, IRISA, Rennes, France
Abstract :
This paper addresses extensions to be made to a basic remote procedure call system for the integration of primitive fault tolerance measures. Our main design goal is to not introduce performance penalty for remote procedure calls executing in the absence of failures, and to not impose significant overhead by the treatment of failures. Basically, extensions include a simple algorithm that finds and eliminates orphans, and a mechanism that detects abnormally terminated remote calls. Our solution for orphan detection as based on the extermination approach, its efficiency coming from a minor addition to the system architecture that allows the implementation of high speed stable storage. Performance measures given by the implementation of our reliability mechanisms on top of the Mach 3.0/BSD UX36 operating system show that the mechanisms are responsible for adding only 1% overhead on the operating system´s base remote procedure call
Keywords :
distributed processing; operating systems (computers); system recovery; RPC systems; failures; fault tolerance; operating system; orphan detection; primitive fault tolerance measures; remote procedure call; Computer crashes; Fault tolerant systems; Operating systems; Out of order; Propagation losses; Terminology; Yarn;
Conference_Titel :
Reliable Distributed Systems, 1994. Proceedings., 13th Symposium on
Conference_Location :
Dana Point, CA
Print_ISBN :
0-8186-6575-0
DOI :
10.1109/RELDIS.1994.336898