Title :
Automatic service availability management
Author :
Cristian, Flaviu
Author_Institution :
Dept. of Comput. Sci. & Eng., California Univ., La Jolla, CA, USA
Abstract :
A new kind of distributed system service called availability management service is introduced. It is responsible for ensuring that the critical services of a distributed system remain continuously available to users despite arbitrary numbers of concurrent node removals and node restarts caused by failures, maintenance, and growth. The description of many details involved in a realistic design is sacrificed to make the underlying concepts easily understandable. To this end, the availability management service is designed on top of an easy-to-understand synchronous communication environment, and only one kind of service availability policy is considered. It is indicated how the initial specification and design can be extended to deal with asynchronous systems subject to partitioning as well as with other kinds of service availability policies
Keywords :
distributed processing; fault tolerant computing; resource allocation; availability management; concurrent node removals; critical services; distributed system service; failures; growth; maintenance; node restarts; service availability policy; synchronous communication environment; Air traffic control; Availability; Computer science; Concurrent computing; Distributed computing; Engineering management; Humans; Object oriented programming; Paper technology; Silver;
Conference_Titel :
Autonomous Decentralized Systems, 1993. Proceedings. ISADS 93., International Symposium on
Conference_Location :
Kawasaki
Print_ISBN :
0-8186-3125-2
DOI :
10.1109/ISADS.1993.262682