Title :
Supermon: a high-speed cluster monitoring system
Author :
Sottile, Matthew J. ; Minnich, Ronald G.
Author_Institution :
Adv. Comput. Lab., Los Alamos Nat. Lab., NM, USA
Abstract :
Supermon is a flexible set of tools for high speed, scalable cluster monitoring. Node behavior can be monitored much faster than with other commonly used methods (e.g., rstatd). In addition, Supermon uses a data protocol based on symbolic expressions (S-expressions) at all levels of Supermon, from individual nodes to entire clusters. This contributes to Supermon´s scalability and allows it to function in a heterogeneous environment. This paper presents the Supermon architecture and discuss initial performance measurements on a cluster of heterogeneous Alpha-processor based nodes.
Keywords :
client-server systems; monitoring; performance evaluation; protocols; workstation clusters; Supermon; data protocol; heterogeneous Alpha-processor based node cluster; heterogeneous environment; high speed scalable cluster monitoring; node behavior monitoring; performance measurements; symbolic expressions; Monitoring;
Conference_Titel :
Cluster Computing, 2002. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-2066-9
DOI :
10.1109/CLUSTR.2002.1137727