Title :
Persistent Locality Management of Scientific Application Workflows
Author :
Aouad, Lamine ; Kechadi, Tahar ; Petiton, Serge
Author_Institution :
Centre for Next Generation Localisation, Univ. of Limerick, Limerick, Ireland
Abstract :
The huge data requirements of large nowadays applications in science and engineering make optimised and scalable data placement mechanisms an essential need. For this purpose, we propose a scheduling scheme based on an efficient data locality management for data-intensive workflows. Transfer and placement decisions are made based on constructions in the workflow, representing inter-relationships between inputs and outputs at its different levels. When running large applications, most of the input data would not be shipped, keeping the data close to the jobs, and resulting on mush less communication and transfer overheads. We have implemented these techniques for the YML workflow system. This paper presents results showing a substantial improvement in the performance of many interdependent multi-level workflows through these data placement optimisations.
Keywords :
database management systems; grid computing; YML workflow system; data Transfer; persistent locality management; scalable data placement mechanisms; scheduling; scientific application workflows; Computer architecture; Distributed databases; Labeling; Libraries; Middleware; Processor scheduling; XML; Persistent data locality; Scientific computing; Workflow;
Conference_Titel :
Computational Science and Engineering (CSE), 2010 IEEE 13th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-9591-7
Electronic_ISBN :
978-0-7695-4323-9
DOI :
10.1109/CSE.2010.60