DocumentCode :
505970
Title :
Advanced data flow support for scientific grid workflow applications
Author :
Qin, Jun ; Fahringer, Thomas
Author_Institution :
University of Innsbruck, Innsbruck, Austria
fYear :
2007
fDate :
10-16 Nov. 2007
Firstpage :
1
Lastpage :
12
Abstract :
Existing work does not provide a flexible dataset-oriented data flow mechanism to meet the complex requirements of scientific Grid workflow applications. In this paper we present a sophisticated approach to this problem by introducing a data collection concept and the corresponding collection distribution constructs, which are inspired by HPF, however applied to Grid workflow applications. Based on these constructs, more fine-grained data flows can be specified at an abstract workflow language level, such as mapping a portion of a dataset to an activity, independently distributing multiple datasets, not necessarily with the same number of data elements, onto loop iterations. Our approach reduces data duplication, optimizes data transfers as well as simplifies the effort to port workflow applications onto the Grid. We have extended AGWL with these concepts and implemented the corresponding runtime support in ASKALON. We apply our approach to some real world scientific workflow applications and report performance results.
Keywords :
Application software; Computer science; Control systems; Data engineering; Engineering management; Grid computing; Permission; Resource management; Runtime; Technology management; data collection; data distribution; data flow; grid workflow;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing, 2007. SC '07. Proceedings of the 2007 ACM/IEEE Conference on
Conference_Location :
Reno, NV, USA
Print_ISBN :
978-1-59593-764-3
Electronic_ISBN :
978-1-59593-764-3
Type :
conf
DOI :
10.1145/1362622.1362679
Filename :
5348801
Link To Document :
بازگشت