DocumentCode :
2663050
Title :
An Efficient and Reliable Scientific Workflow System
Author :
Tavares, Thiago ; Teodoro, George ; Kurc, Tahsin ; Ferreira, Ricardo ; Guedes, Dorgival ; Meira, Wagner ; Catalyurek, Umit
Author_Institution :
Dept. of Comput. Sci., Univ. Fed. de Minas Gerais, Belo Horizonte
fYear :
2007
fDate :
14-17 May 2007
Firstpage :
445
Lastpage :
452
Abstract :
This paper presents a fault tolerance framework for applications that process data using a distributed network of user-defined operations in a pipelined fashion. The framework saves intermediate results and messages exchanged among application components in a distributed data management system to facilitate quick recovery from failures. The experimental results show that the framework scales well and our approach introduces very little overhead to application execution.
Keywords :
middleware; natural sciences computing; software fault tolerance; system recovery; workflow management software; distributed data management system; distributed network; fault tolerance; middleware; scientific workflow system; user-defined operations; Biomedical computing; Biomedical informatics; Computer networks; Data analysis; Data processing; Distributed computing; Fault tolerance; Fault tolerant systems; Middleware; Protocols;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing and the Grid, 2007. CCGRID 2007. Seventh IEEE International Symposium on
Conference_Location :
Rio De Janeiro
Print_ISBN :
0-7695-2833-3
Type :
conf
DOI :
10.1109/CCGRID.2007.20
Filename :
4215410
Link To Document :
بازگشت