• DocumentCode
    238958
  • Title

    A Cleanup Algorithm for Implementing Storage Constraints in Scientific Workflow Executions

  • Author

    SRINIVASAN, SUDARSHAN ; Juve, Gideon ; Da Silva, Rafael Ferreira ; Vahi, Karan ; Deelman, Ewa

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Hyderabad, Hyderabad, India
  • fYear
    2014
  • fDate
    16-16 Nov. 2014
  • Firstpage
    41
  • Lastpage
    49
  • Abstract
    Scientific workflows are often used to automate large-scale data analysis pipelines on clusters, grids, and clouds. However, because workflows can be extremely data-intensive, and are often executed on shared resources, it is critical to be able to limit or minimize the amount of disk space that workflows use on shared storage systems. This paper proposes a novel and simple approach that constrains the amount of storage space used by a workflow by inserting data cleanup tasks into the workflow task graph. Unlike previous solutions, the proposed approach provides guaranteed limits on disk usage, requires no new functionality in the underlying workflow scheduler, and does not require estimates of task runtimes. Experimental results show that this algorithm significantly reduces the number of cleanup tasks added to a workflow and yields better workflow makespans than the strategy currently used by the Pegasus Workflow Management System.
  • Keywords
    cloud computing; data analysis; graph theory; grid computing; natural sciences computing; storage management; workflow management software; Pegasus workflow management system; cleanup algorithm; disk space amount minimization; large-scale data analysis automation; scientific workflow executions; storage constraints; workflow task graph; Clustering algorithms; Electronic mail; Parallel processing; Partitioning algorithms; Pipelines; Planning; Runtime;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Workflows in Support of Large-Scale Science (WORKS), 2014 9th Workshop on
  • Conference_Location
    New Orleans, LA
  • Type

    conf

  • DOI
    10.1109/WORKS.2014.8
  • Filename
    7019861