Title :
Failure-Resilient Computations in the EcliPSe System
Author :
Knop, Felipe ; Rego, Vernon ; Sunderam, Vaidy ; Ferrari, Adam
Abstract :
Local or wide-area connected workstation cluster-based computation systems are inherently failure-prone, particularly for long running computations. In this work we introduce a variety of features for failure resilience in the EcliPSe system for replicative applications. Key characteristics of fault-tolerant EcliPSe are ease of use, low statesaving costs, system scalability and good performance.
Conference_Titel :
Parallel Processing, 1994. ICPP 1994 Volume 3. International Conference on
Conference_Location :
North Carolina, USA
Print_ISBN :
0-8493-2493-9
DOI :
10.1109/ICPP.1994.111