Title :
HPC Cluster Monitoring System Architecture Design and Implement
Author :
Li, Min ; Zhang, Yisheng
Author_Institution :
State Key Lab. of Mater. Process. & Die & Mould Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
Abstract :
High performance computing (HPC) facilities such as HPC clusters, as building blocks of Grid computing, are playing an important role in computational Grid. HPC monitoring in HPC cluster systems presents an important challenge because HPC cluster environments are volatile, heterogeneous, not reliable and are managed by different middleware and systems. In this paper, we propose an HPC cluster monitoring system with four tier structure for Grid computing and utility computing clusters. It provides the basic function such as job monitoring, and system monitoring, etc. With our prototype, the Grid users are able to find the available cluster nodes, and customize their preferred HPC cluster nodes for their computation intensive applications in Grid computing or utility computing.. Experiments show that our work provide great convenience and flexibility for users to make good use of HPC cluster.
Keywords :
grid computing; monitoring; software architecture; cluster monitoring system architecture; grid computing; high performance computing; Computer architecture; Distributed computing; Grid computing; High performance computing; Laboratories; Materials processing; Materials science and technology; Monitoring; Prototypes; Web services; Ganglia; High Performance Computing; Monitoring; Web Service;
Conference_Titel :
Intelligent Computation Technology and Automation, 2009. ICICTA '09. Second International Conference on
Conference_Location :
Changsha, Hunan
Print_ISBN :
978-0-7695-3804-4
DOI :
10.1109/ICICTA.2009.314