Title :
Low-overhead protocols for fault-tolerant file sharing
Author :
Alvisi, Lorenzo ; Rao, Sriram ; Vin, Harrick M.
Author_Institution :
Dept. of Comput. Sci., Texas Univ., Austin, TX, USA
Abstract :
We quantify the adverse effect of file sharing on the performance of reliable distributed applications. We demonstrate that file sharing incurs significant overhead, which is likely to triple over the next five years. We present a novel approach that eliminates this overhead. Our approach: tracks causal dependencies resulting from file sharing using determinants; efficiently replicates the determinants in the volatile memory of agents to ensure their availability during recovery; and reproduces during recovery the interactions with the file server as well as the file data lost in a failure. Our approach allows agents to exchange files directly without first saving the files on disks at the server. As a consequence, the costs of supporting file sharing and message passing in a reliable distributed application become virtually identical. The result is a simple, uniform approach, which can provide low-overhead fault tolerance to applications in which communication is performed through message passing, file sharing, or a combination of the two
Keywords :
file servers; message passing; protocols; software fault tolerance; software performance evaluation; system recovery; application performance; causal dependencies; fault-tolerant file sharing; file server; low overhead protocols; message passing; recovery; reliable distributed applications; software reliability; Access protocols; Checkpointing; Costs; Delay; Fault tolerance; File servers; Message passing; Peer to peer computing; Read only memory;
Conference_Titel :
Distributed Computing Systems, 1998. Proceedings. 18th International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
0-8186-8292-2
DOI :
10.1109/ICDCS.1998.679774