Title :
A Taxonomy for the Analysis of Scientific Workflow Faults
Author :
Lackovic, Marco ; Talia, Domenico ; Tolosana-Calasanz, Rafael ; Bañares, José A. ; Rana, Omer F.
Author_Institution :
Dept. of Electron., Comput. Sci. & Syst., Univ. of Calabria, Cosenza, Italy
Abstract :
Scientific workflows generally involve the distribution of tasks to distributed resources, which may exist in different administrative domains. The use of distributed resources in this way may lead to faults, and detecting them, identifying them and subsequently correcting them remains an important research challenge. We introduce a fault taxonomy for scientific workflows that may help in conducting a systematic analysis of faults, so that the potential faults that may arise at execution time can be corrected (recovered from). The presented taxonomy is motivated by previous work [4], but has a particular focus on workflow environments (compared to previous work which focused on Grid-based resource management) and demonstrated through its use in Weka4WS.
Keywords :
data mining; fault diagnosis; grid computing; resource allocation; software fault tolerance; workflow management software; Weka4WS; distributed resource; fault correction; fault detection; fault systematic analysis; fault taxonomy; scientific workflow faults analysis; Data mining; Fault detection; Fault tolerance; Fault tolerant systems; Middleware; Monitoring; Taxonomy; Fault Tolerance; Scientific Workflows;
Conference_Titel :
Computational Science and Engineering (CSE), 2010 IEEE 13th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-9591-7
Electronic_ISBN :
978-0-7695-4323-9
DOI :
10.1109/CSE.2010.59