DocumentCode :
1188403
Title :
Scalability tests of R-GMA-based grid job monitoring system for CMS Monte Carlo data production
Author :
Bonacorsi, D. ; Colling, D. ; Field, L. ; Fisher, S.M. ; Grandi, C. ; Hobson, P.R. ; Kyberd, P. ; MacEvoy, B. ; Nebrensky, J.J. ; Tallini, H. ; Traylen, S.
Author_Institution :
Inst. Nazionale di Fisica Nucl.e, Bologna, Italy
Volume :
51
Issue :
6
fYear :
2004
Firstpage :
3026
Lastpage :
3029
Abstract :
High-energy physics experiments, such as the compact muon solenoid (CMS) at the large hadron collider (LHC), have large-scale data processing computing requirements. The grid has been chosen as the solution. One important challenge when using the grid for large-scale data processing is the ability to monitor the large numbers of jobs that are being executed simultaneously at multiple remote sites. The relational grid monitoring architecture (R-GMA) is a monitoring and information management service for distributed resources based on the GMA of the Global Grid Forum. We report on the first measurements of R-GMA as part of a monitoring architecture to be used for batch submission of multiple Monte Carlo simulation jobs running on a CMS-specific LHC computing grid test bed. Monitoring information was transferred in real time from remote execution nodes back to the submitting host and stored in a database. In scalability tests, the job submission rates supported by successive releases of R-GMA improved significantly, approaching that expected in full-scale production.
Keywords :
Monte Carlo methods; database management systems; grid computing; high energy physics instrumentation computing; information management; CMS Monte Carlo data production; CMS-specific LHC computing grid test bed; LHC; R-GMA-based grid job monitoring system; batch submission; compact muon solenoid; database; distributed resources; global grid forum; high-energy physics experiments; information management service; job submission rates; jobs execution; large hadron collider; large-scale data processing computing requirements; monitoring information; multiple Monte Carlo simulation jobs; multiple remote sites; relational grid monitoring architecture; remote execution nodes; scalability tests; Collision mitigation; Computer architecture; Data processing; Job production systems; Large Hadron Collider; Large-scale systems; Monte Carlo methods; Remote monitoring; Scalability; System testing;
fLanguage :
English
Journal_Title :
Nuclear Science, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9499
Type :
jour
DOI :
10.1109/TNS.2004.839094
Filename :
1369428
Link To Document :
بازگشت