Title :
AOCMS: An Adaptive and Scalable Monitoring System for Large-Scale Clusters
Author :
Xue, Zhenghua ; Dong, Xiaoshe ; Wu, Weiguo
Author_Institution :
Dept. of Comput. Sci. & Technol., Xi´´an Jiaotong Univ.
Abstract :
In this paper, we present the design and implementation of AOCMS, an adaptive, scalable and efficient monitoring system for a large-scale cluster. We describe an adaptive architecture of AOCMS in detail, and focus on the discussion about some techniques as to enhancing the adaptation, scalability and efficiency of AOCMS. These techniques include: a solution to monitor a heterogeneous cluster; a universal applet-servlet communicating controller responsible for communication between the clients and the Web server; adaptive pools providing threads or connections to the database for the monitoring tasks on demand; and an AOP-based alarm decoupling the alarming logic from the monitoring logic. Moreover, we measured the performance of AOCMS. The results show that AOCMS runs with low overheads and responds to clients quickly
Keywords :
client-server systems; distributed programming; object-oriented programming; workstation clusters; AOCMS; AOP-based alarm decoupling; Web server; adaptive architecture; adaptive scalable monitoring system; applet-servlet communicating controller; large-scale clusters; Adaptive control; Communication system control; Databases; Large-scale systems; Logic; Monitoring; Programmable control; Scalability; Web server; Yarn;
Conference_Titel :
Services Computing, 2006. APSCC '06. IEEE Asia-Pacific Conference on
Conference_Location :
Guangzhou, Guangdong
Print_ISBN :
0-7695-2751-5
DOI :
10.1109/APSCC.2006.34