• DocumentCode
    1913270
  • Title

    A General Approach to Real-Time Workflow Monitoring

  • Author

    Vahi, Karan ; Harvey, Ian ; Samak, Taghrid ; Gunter, Dan ; Evans, Kim ; Rogers, D. ; Taylor, Ian ; Goode, Monte ; Silva, Francisco ; Al-Shakarchi, Eddie ; Mehta, Garima ; Jones, Andrew ; Deelman, Ewa

  • Author_Institution
    USC Inf. Sci. Inst., Marina Del Rey, CA, USA
  • fYear
    2012
  • fDate
    10-16 Nov. 2012
  • Firstpage
    108
  • Lastpage
    118
  • Abstract
    Scientific workflow systems support different workflow representations, operational modes and configurations. However, independent of the system used, end users need to track the status of their workflows in real time, be notified of execution anomalies and failures automatically, perform troubleshooting and automate the analysis of the workflow to help categorize and qualify the results. In this paper, we describe how the Stampede monitoring infrastructure, which was previously integrated in the Pegasus Workflow Management System, was employed in Triana in order to add generic real time monitoring and troubleshooting capabilities across both systems. Stampede is an infrastructure that attempts to address interoperable monitoring needs by providing a three-layer model: a common data model to describe workflow and job executions; high-performance tools to load workflow logs conforming to the data model into a data store, and a querying interface for extracting information from the data store in a standard fashion. The resulting integration demonstrates the generic nature of the Stampede monitoring infrastructure that has the potential to provide a common platform for monitoring across scientific workflow engines.
  • Keywords
    data handling; open systems; real-time systems; system recovery; workflow management software; Pegasus workflow management system; data model; general approach; interoperable monitoring; operational configurations; operational modes; querying interface; real-time workflow monitoring; scientific workflow engines; scientific workflow systems; stampede monitoring infrastructure; workflow representations; interoperability; log analysis; monitoring; scientific workflows; workflow performance statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
  • Conference_Location
    Salt Lake City, UT
  • Print_ISBN
    978-1-4673-6218-4
  • Type

    conf

  • DOI
    10.1109/SC.Companion.2012.26
  • Filename
    6495808