Title :
OGSA-based grid workload monitoring
Author :
Zhang, Rui ; Moyle, Steve ; McKeever, Steve ; Heisig, Stephen
Author_Institution :
Comput. Lab., Oxford Univ., UK
Abstract :
In heterogeneous and dynamic distributed systems like the grid, detailed monitoring of workload and its resulting system performance (e.g. response time) is required to facilitate performance diagnosis and adaptive performance tuning. In this paper, we present a workload monitoring infrastructure for this purpose. The infrastructure classifies and monitors workload across components in grids based on the open grid service architecture (OGSA) in an end-to-end manner. It provides the abilities to assess what components are involved in processing a work unit, to report time elapsed at these components, and to capture concurrency and isolate which components are critical to overall performance observed. These are enclosed in an automatically constructed Response Time Service Petri Net (RT-SPN) model. A tool is provided to accept queries about work units and visualise corresponding RTSPNs. The infrastructure is also designed and implemented so as to be portable, scalable and lightweight.
Keywords :
Petri nets; grid computing; monitoring; open systems; performance evaluation; OGSA; Response Time Service Petri Net model; adaptive performance tuning; grid workload monitoring; heterogeneous distributed systems; open grid service architecture; Concurrent computing; Delay; Distributed computing; Grid computing; Laboratories; Monitoring; Service oriented architecture; System performance; Visualization; Web services;
Conference_Titel :
Cluster Computing and the Grid, 2005. CCGrid 2005. IEEE International Symposium on
Print_ISBN :
0-7803-9074-1
DOI :
10.1109/CCGRID.2005.1558628