Title :
Optimal fault-tolerant resource allocation in dynamic distributed systems
Author_Institution :
Dept. of Comput. Sci., Warwick Univ., Coventry
Abstract :
This paper presents a fault-tolerant resource allocation algorithm in a dynamic distributed message passing system, where concurrent processes sharing system resources can be created or terminated dynamically. The degree of fault-tolerance is measured by the failure locality that is the maximum number of processes whose liveness conditions (e.g., starvation freedom) cannot be satisfied because of a single process failure. The algorithm guarantees the optimal failure locality
Keywords :
distributed processing; fault tolerant computing; message passing; resource allocation; concurrent processes; dynamic distributed message passing system; dynamic distributed systems; failure locality; liveness conditions; optimal fault-tolerant resource allocation; starvation freedom; system resources; Aging; Computer crashes; Computer science; Database systems; Fault tolerance; Fault tolerant systems; Heuristic algorithms; Message passing; Resource management; System recovery;
Conference_Titel :
Parallel and Distributed Processing, 1995. Proceedings. Seventh IEEE Symposium on
Conference_Location :
San Antonio, TX
Print_ISBN :
0-81867195-5
DOI :
10.1109/SPDP.1995.530719