DocumentCode
1913270
Title
A General Approach to Real-Time Workflow Monitoring
Author
Vahi, Karan ; Harvey, Ian ; Samak, Taghrid ; Gunter, Dan ; Evans, Kim ; Rogers, D. ; Taylor, Ian ; Goode, Monte ; Silva, Francisco ; Al-Shakarchi, Eddie ; Mehta, Garima ; Jones, Andrew ; Deelman, Ewa
Author_Institution
USC Inf. Sci. Inst., Marina Del Rey, CA, USA
fYear
2012
fDate
10-16 Nov. 2012
Firstpage
108
Lastpage
118
Abstract
Scientific workflow systems support different workflow representations, operational modes and configurations. However, independent of the system used, end users need to track the status of their workflows in real time, be notified of execution anomalies and failures automatically, perform troubleshooting and automate the analysis of the workflow to help categorize and qualify the results. In this paper, we describe how the Stampede monitoring infrastructure, which was previously integrated in the Pegasus Workflow Management System, was employed in Triana in order to add generic real time monitoring and troubleshooting capabilities across both systems. Stampede is an infrastructure that attempts to address interoperable monitoring needs by providing a three-layer model: a common data model to describe workflow and job executions; high-performance tools to load workflow logs conforming to the data model into a data store, and a querying interface for extracting information from the data store in a standard fashion. The resulting integration demonstrates the generic nature of the Stampede monitoring infrastructure that has the potential to provide a common platform for monitoring across scientific workflow engines.
Keywords
data handling; open systems; real-time systems; system recovery; workflow management software; Pegasus workflow management system; data model; general approach; interoperable monitoring; operational configurations; operational modes; querying interface; real-time workflow monitoring; scientific workflow engines; scientific workflow systems; stampede monitoring infrastructure; workflow representations; interoperability; log analysis; monitoring; scientific workflows; workflow performance statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
Conference_Location
Salt Lake City, UT
Print_ISBN
978-1-4673-6218-4
Type
conf
DOI
10.1109/SC.Companion.2012.26
Filename
6495808
Link To Document