Title :
Monitoring High Performance Computing Systems for the End User
Author :
Christopher Lee Moore;Prabhu Singh Khalsa;Todd Alan Yilk;Michael Mason
Author_Institution :
High Performance Comput. Group 3, Los Alamos Nat. Lab., Los Alamos, NM, USA
Abstract :
Monitoring High Performance Computing clusters is currently geared towards providing system administrators the information they need to make informed decisions on the resources used in the cluster. However, this emphasis leaves out the End User, those who utilize the cluster resources towards projects and programs, as they are not given the information of how their workflow is impacting the cluster. By providing a subset of monitoring data in a format End Users can easily interpret and utilize, they can help make better use of the computing resources provided to them.
Keywords :
"Monitoring","File systems","High performance computing","Routing","Databases","Documentation","Measurement"
Conference_Titel :
Cluster Computing (CLUSTER), 2015 IEEE International Conference on
DOI :
10.1109/CLUSTER.2015.124