Title :
Group communication protocol for flexible distributed systems
Author :
Higaki, Hiroaki ; Takizawa, Makoto
Author_Institution :
Tokyo Denki Univ., Saitama, Japan
fDate :
29 Oct-1 Nov 1996
Abstract :
In large-scale distributed systems, the processes have to be upgraded to absorb the changes of user requirements and system environments. The system cannot be kept available by the conventional upgrading methods because multiple processes have to be suspended simultaneously. This paper discusses a new method where each process can invoke asynchronously the upgrading procedure. The key idea is that multiple versions of processes can be operated temporarily. Each pair of an old-version process and a new-version one are managed as one process group. The group communication protocol proposed supports the message transmission among the process groups. Moreover, the protocol detects protocol errors caused by the co-existence of multiple versions of processes. A checkpoint-rollback algorithm for resolving the protocol errors is proposed. By using the algorithm, the minimum number of processes are rolled back asynchronously. Hence, the system is highly available even if protocol error occurs
Keywords :
distributed processing; error detection; groupware; message switching; protocols; checkpoint-rollback algorithm; computer networks; distributed programs; dynamic upgrading method; flexible distributed systems; group communication protocol; large-scale distributed systems; message transmission; process groups; protocol error detection; system environments; Communication networks; Computer errors; Computer networks; Costs; Distributed computing; Information systems; Large-scale systems; Protocols; System recovery; Testing;
Conference_Titel :
Network Protocols, 1996. Proceedings., 1996 International Conference on
Conference_Location :
Columbus, OH
Print_ISBN :
0-8186-7453-9
DOI :
10.1109/ICNP.1996.564901