• DocumentCode
    3646922
  • Title

    Automating Data-Throttling Analysis for Data-Intensive Workflows

  • Author

    R. J. Rodríguez;R. Tolosana-Calasanz;O. F. Rana

  • Author_Institution
    Dipt. de Inf. e Ing. de Sist., Univ. de Zaragoza, Zaragoza, Spain
  • fYear
    2012
  • fDate
    5/1/2012 12:00:00 AM
  • Firstpage
    310
  • Lastpage
    317
  • Abstract
    Data movement between tasks in scientific workflows has received limited attention compared to task execution. Often the staging of data between tasks is either assumed or the time delay in data transfer is considered to be negligible (compared to task execution). Where data consists of files, such file transfers are accomplished as fast as the network links allow, and once transferred, the files are buffered/stored at their destination. Where a task requires multiple files to execute (from different tasks), it must, however, remain idle until all files are available. Hence, network bandwidth and buffer/storage within a workflow are often not used effectively. We propose an automated workflow structural analysis method for Directed Acyclic Graphs (DAGs) which utilises information from previous workflow executions. The method obtains data-throttling values for the data transfer to enable network bandwidth and buffer/storage capacity to be managed more efficiently. We convert a DAG representation into a Petri net model and analyse the resulting graph using an iterative method to compute data-throttling values. Our approach is demonstrated using the Montage workflow.
  • Keywords
    "Bandwidth","Buffer storage","Computational modeling","Network topology","Data models","Petri nets","Delay"
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2012 12th IEEE/ACM International Symposium on
  • Print_ISBN
    978-1-4673-1395-7
  • Type

    conf

  • DOI
    10.1109/CCGrid.2012.27
  • Filename
    6217436