Title :
The Extended Finite State Machine and Fault Tolerant Mechanism in Distributed Systems
Author :
Gao, Shengfa ; Li, Xin ; Zhang, Ruihua
Author_Institution :
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
Abstract :
Synchronization and fault tolerance of processes are emphasis in the distributed systems research, but only a few people involves in the mathematics model used in processes synchronization and fault tolerance yet. This paper takes distributed system as an event driven system, classify the events that cause system state variety into four classes, and proposed an extended finite state machine (EFSM) with synchronization and fault tolerant message to the distributed system. Accordingly, a checkpoint set up algorithm based in this EFSM is proposed. During the establishing of the checkpoint, the consistency of checkpoint can be determined by calculating the number of sending and receiving messages. In case of lost message, sending and receiving process that lost message can be found by checking the number of sending and receiving messages, and the lost messages can be retransmitted and received. Thus the establishing of the distributed systems global state has been simplified.
Keywords :
checkpointing; distributed processing; fault tolerant computing; finite state machines; synchronisation; checkpoint set up algorithm; distributed system; event classification; event driven system; extended finite state machine; fault tolerant message; mathematics model; synchronization; Automata; Clocks; Computer aided instruction; Computer science; Conference management; Fault tolerant systems; Mathematical model; Mathematics; Software engineering; Synchronization; Events classification; Extended Finite State Machine; checkpoint;
Conference_Titel :
Software Engineering Research, Management and Applications, 2009. SERA '09. 7th ACIS International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-3903-4
DOI :
10.1109/SERA.2009.33