Title :
On efficiently tolerating general failures in autonomous decentralized multiserver systems
Author :
Yen, I-Ling ; Bastani, Farokh B.
Author_Institution :
Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Abstract :
We consider a multiserver system consisting of a set of servers that provide some service to a set of clients by accessing some shared objects. The goal is to provide reliable service in spite of client or server failures such that the overhead during normal operating periods is low. We consider a relatively general fault model where a faulty processor can write spurious data for a period of time before it is detected and removed from the system. We first develop a solution in an autonomous physical world of clients and servers. The basic approach is to divide the servers into groups such that each server has some limitations which prevent it from arbitrarily damaging the system. This solution is then mapped to a distributed system of processor and memory units followed by an assessment of its performance
Keywords :
client-server systems; computer network reliability; software fault tolerance; autonomous decentralized multiserver systems; autonomous physical world; client failures; clients; damage prevention; distributed system; efficient general failure tolerance; faulty processor; general fault model; memory units; normal operating period overhead; performance assessment; reliable service; server failures; shared objects; spurious data writing; Computer science; Concurrent computing; Costs; Degradation; Delay; Distributed computing; Distributed processing; Fault detection; Fault tolerant systems; File servers; High performance computing;
Conference_Titel :
Autonomous Decentralized Systems, 1995. Proceedings. ISADS 95., Second International Symposium on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-8186-7087-8
DOI :
10.1109/ISADS.1995.398986