DocumentCode :
3124773
Title :
RACE: a software-based fault tolerance scheme for systematically transforming ordinary algorithms to robust algorithms
Author :
Yeh, Chi-Hsiang ; Parhami, Behrooz ; Varavrigos, E.A. ; Varvarigou, Theodora A.
Author_Institution :
Dept. of Electr. & Comput. Eng., Queen´´s Univ., Kingston, Ont., Canada
fYear :
2001
fDate :
36982
Abstract :
We propose the robust algorithm-configured emulation (RACE) scheme for efficient parallel computation and communication in the presence of faults. A wide variety of algorithms originally designed for fault-free meshes, tori, and k-ary n-cubes can be transformed to corresponding robust algorithm through RACE. In particular optimal robust algorithms can be derived for total exchange (TE) and ascend/descend operations with a factor of 1+o (1) slowdown. Also, RACE can tolerate a large number of faulty elements, without relying on hardware redundancy or any assumption about the availability of a complete subarray
Keywords :
hypercube networks; parallel algorithms; software fault tolerance; sorting; RACE; ascend/descend operations; fault-free meshes; hardware redundancy; k-ary n-cubes; optimal robust algorithms; parallel computation; robust algorithm-configured emulation; robust algorithms; software-based fault tolerance scheme; tori; total exchange; Algorithm design and analysis; Concurrent computing; Emulation; Fault tolerant systems; Hardware; Redundancy; Robustness; Scalability; Sorting; Tellurium;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium., Proceedings 15th International
Conference_Location :
San Francisco, CA
ISSN :
1530-2075
Print_ISBN :
0-7695-0990-8
Type :
conf
DOI :
10.1109/IPDPS.2001.924968
Filename :
924968
Link To Document :
بازگشت