• DocumentCode
    558698
  • Title

    Online workflow management and performance analysis with Stampede

  • Author

    Gunter, Dan ; Deelman, Ewa ; Samak, Taghrid ; Brooks, Christopher H. ; Goode, Monte ; Juve, Gideon ; Mehta, Gaurang ; Moraes, Priscilla ; Silva, Fabio ; Swany, Martin ; Vahi, Karan

  • Author_Institution
    Lawrence Berkeley Nat. Lab., Berkeley, CA, USA
  • fYear
    2011
  • fDate
    24-28 Oct. 2011
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    Scientific workflows are an enabler of complex scientific analyses. They provide both a portable representation and a foundation upon which results can be validated and shared. Large-scale scientific workflows are executed on equally complex parallel and distributed resources, where many things can fail. Application scientists need to track the status of their workflows in real time, detect execution anomalies automatically, and perform troubleshooting - without logging into remote nodes or searching through thousands of log files. As part of the NSF Stampede project, we have developed an infrastructure to answer these needs. The infrastructure captures application-level logs and resource information, normalizes these to standard representations, and stores these logs in a centralized general-purpose schema. Higher-level tools mine the logs in real time to determine current status, predict failures, and detect anomalous performance.
  • Keywords
    parallel processing; scientific information systems; workflow management software; NSF Stampede project; anomalous performance detection; application-level log; centralized general-purpose schema; complex scientific analysis; distributed resources; execution anomalies; higher-level tool; large-scale scientific workflow; online workflow management; parallel resources; performance analysis; resource information; troubleshooting; Broadband communication; Data models; Databases; Monitoring; Real time systems; Servers; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network and Service Management (CNSM), 2011 7th International Conference on
  • Conference_Location
    Paris
  • Print_ISBN
    978-1-4577-1588-4
  • Electronic_ISBN
    978-3-901882-44-9
  • Type

    conf

  • Filename
    6103988