Title :
Supporting distributed application management in Sampa
Author :
Endler, Markus ; Souza, Anil J D
Author_Institution :
Inst. de Matematica e Estatistica, Sao Paulo Univ., Brazil
Abstract :
The paper presents the architecture and base services of Sampa, a System for Availability Management of Process-based Applications. The system has been designed to support the management of fault-tolerant DCE-based distributed programs according to user provided and application-specific availability specifications. Sampa is supposed to detect and automatically react to faults such as node crashes, network partitions, process crashes and hang-ups. We focus on the design of its base services-the monitoring, reliable group communication and checkpointing facilities and show how they can be used for managing a generic replicated service.
Keywords :
distributed processing; formal specification; software fault tolerance; system monitoring; system recovery; systems software; Sampa; Sampa architecture; Sampa base services; application-specific availability specifications; availability management; checkpointing facilities; distributed application management support; fault-tolerant DCE-based distributed programs; generic replicated service; hang-ups; monitoring; network partitions; node crashes; process crashes; process-based applications; reliable group communication; user provided availability specifications; Availability; Checkpointing; Computer architecture; Computer crashes; Data analysis; Electronic mail; Fault detection; Fault tolerant systems; Monitoring; Project management;
Conference_Titel :
Configurable Distributed Systems, 1996. Proceedings., Third International Conference on
Conference_Location :
Annapolis, MD, USA
Print_ISBN :
0-8186-7395-8
DOI :
10.1109/CDS.1996.509360