Title :
Achieving scalability for job centric monitoring in a distributed infrastructure
Author :
Hilbrich, Marcus ; Müller-Pfefferkorn, Ralph
Author_Institution :
Center for Inf. Services & High Performance Comput. (ZIH), Tech. Univ. Dresden, Dresden, Germany
Abstract :
Job centric monitoring allows to observe jobs on remote computing resources. It may offer visualisation of recorded monitoring data and helps to find faulty or misbehaving jobs. If installations like grids or clouds are observed monitoring data of many thousands of jobs have to be handled. The challenge of job centric monitoring infrastructures is to store, search and access data collected in huge installations like grids or clouds. We take this challenge with a distributed layer based architecture which provides a uniform view to all monitoring data. The concept of this infrastructure called SLAte and an analysis of the scalability is provided in this paper.
Keywords :
cloud computing; data visualisation; grid computing; information retrieval; storage management; SLAte; clouds; data access; data searching; data storage; distributed infrastructure; distributed layer based architecture; faulty jobs; grids; job centric monitoring infrastructures; misbehaving jobs; recorded monitoring data visualization; remote computing resources; Computer architecture; Data visualization; Distributed databases; Monitoring; Scalability; Servers; Time measurement;
Conference_Titel :
ARCS Workshops (ARCS), 2012
Conference_Location :
Muenchen
Print_ISBN :
978-1-4673-1913-3
Electronic_ISBN :
978-3-88579-294-9