Title :
Reliability analysis in distributed systems
Author :
Raghavendra, C.S. ; Kumar, V. K Prasanna ; Hariri, Salim
Author_Institution :
Dept. of Electr. Eng.-Syst., Univ. of Southern California, Los Angeles, CA, USA
fDate :
3/1/1988 12:00:00 AM
Abstract :
Reliability of a distributed processing system is an important design parameter that can be described in terms of the reliability of processing elements and communication links and also of the redundancy of programs and data files. The traditional terminal-pair reliability does not capture the redundancy of programs and files in a distributed system. Two reliability measures are introduced: distributed program reliability, which describes the probability of successful execution of a program requiring cooperation of several computers, and distributed system reliability, which is the probability that all the specified distributed programs for the system are operational. These two reliability measures can be extended to incorporate the effects of user sites on reliability. An efficient approach based on graph traversal is developed to evaluate the proposed reliability measures
Keywords :
distributed processing; fault tolerant computing; communication links; data files; design parameter; distributed program reliability; distributed systems; graph traversal; redundancy; Computer network reliability; Distributed computing; Distributed processing; Intelligent networks; Load management; Redundancy; Resource management; Surface-mount technology; Telecommunication network reliability; Tree graphs;
Journal_Title :
Computers, IEEE Transactions on