DocumentCode :
292320
Title :
Implementing halt on failure processors
Author :
Macdonald, R.N. ; Shoja, G.C.
Author_Institution :
Dept. of Comput. Sci., Victoria Univ., BC, Canada
Volume :
1
fYear :
1993
fDate :
19-21 May 1993
Firstpage :
272
Abstract :
The problem of detecting and masking failed processes in a distributed processing environment is considered. The authors propose a virtual halt on failure processor where replicated processes are used to achieve fault tolerance. Processor failures are detected and masked up to a certain limit. Once the threshold of permissible node failures is exceeded, the virtual processor reports the failure and halts. The authors contend that this is more practical and efficient than the generally assumed fail-stop processor. Results of an implementation in the REM (Remote Execution Manager) environment are presented
Keywords :
distributed processing; fault tolerant computing; virtual machines; Remote Execution Manager; distributed processing; fault tolerance; halt on failure processors; processor failure masking; replicated processes; virtual processor; Computer errors; Computer science; Distributed processing; Fault detection; Fault tolerance; Fault tolerant systems; Scholarships; Time factors; Timing; Workstations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Computers and Signal Processing, 1993., IEEE Pacific Rim Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
0-7803-0971-5
Type :
conf
DOI :
10.1109/PACRIM.1993.407171
Filename :
407171
Link To Document :
بازگشت