DocumentCode
2243983
Title
A Distributed Workflow Mapping Algorithm for Minimum End-to-End Delay under Fault-Tolerance Constraint
Author
Wu, Qishi ; Gu, Yi
Author_Institution
Dept. of Comput. Sci., Univ. of Memphis, Memphis, TN, USA
fYear
2010
fDate
8-10 Dec. 2010
Firstpage
508
Lastpage
515
Abstract
Many large-scale scientific applications feature distributed computing workflows of complex structures that must be executed and transferred in shared wide-area networks consisting of unreliable nodes and links. Mapping these computing workflows in such faulty network environments for optimal latency while ensuring certain fault tolerance is crucial to the success of eScience that requires both performance and reliability. We construct analytical cost models and formulate workflow mapping as an optimization problem under failure rate constraint. We propose a distributed heuristic mapping solution based on recursive critical path to achieve minimum end-to-end delay and satisfy a pre-specified overall failure rate for a guaranteed level of fault tolerance. The performance superiority of the proposed mapping solution is illustrated by extensive simulation-based comparisons with existing mapping algorithms.
Keywords
distributed processing; fault tolerant computing; optimisation; software reliability; complex structures; distributed computing workflows; distributed heuristic mapping solution; distributed workflow mapping algorithm; eScience; fault-tolerance constraint; faulty network environments; large-scale scientific applications; minimum end-to-end delay; optimization problem; shared wide-area networks; distributed algorithm; end-to-end delay; fault tolerance; scientific workflow;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Systems (ICPADS), 2010 IEEE 16th International Conference on
Conference_Location
Shanghai
ISSN
1521-9097
Print_ISBN
978-1-4244-9727-0
Electronic_ISBN
1521-9097
Type
conf
DOI
10.1109/ICPADS.2010.38
Filename
5695642
Link To Document