Title :
Performance of R-GMA for monitoring grid jobs for CMS data production
Author :
Byrom, R. ; Colling, D. ; Fisher, S.M. ; Grandi, C. ; Hobson, P.R. ; Kyberd, P. ; MacEvoy, B. ; Nebrensky, J.J. ; Traylen, S.
Author_Institution :
Particle Phys. Dept., Rutherford Appleton Lab., Chilton
Abstract :
High energy physics experiments, such as the Compact Muon Solenoid (CMS) at the CERN laboratory in Geneva, have large-scale data processing requirements, with data accumulating at a rate of 1 Gbyte/s. This load comfortably exceeds any previous processing requirements and we believe it may be most efficiently satisfied through grid computing. Furthermore the production of large quantities of Monte Carlo simulated data provides an ideal test bed for grid technologies and will drive their development. One important challenge when using the grid for data analysis is the ability to monitor transparently the large number of jobs that are being executed simultaneously at multiple remote sites. R-GMA is a monitoring and information management service for distributed resources based on the grid monitoring architecture of the Global Grid Forum. We have previously developed a system allowing us to test its performance under a heavy load while using few real grid resources. We present the latest results on this system running on the LCG 2 grid test bed using the LCG 2.6.0 middleware release. For a sustained load equivalent to 7 generations of 1000 simultaneous jobs, R-GMA was able to transfer all published messages and store them in a database for 98% of the individual jobs. The failures experienced were at the remote sites, rather than at the archiver´s MON box as had been expected
Keywords :
Monte Carlo methods; grid computing; high energy physics instrumentation computing; middleware; position sensitive particle detectors; CMS data production; Compact Muon Solenoid; Global Grid Forum; LCG 2 grid test bed; LCG 2.6.0 middleware; Monte Carlo simulated data; R-GMA; archiver MON box; distributed resources; grid computing; grid technologies; high energy physics experiments; information management service; monitoring management service; relational grid monitoring architecture; remote sites; Collision mitigation; Data processing; Grid computing; Job production systems; Laboratories; Large-scale systems; Mesons; Remote monitoring; Solenoids; System testing;
Conference_Titel :
Nuclear Science Symposium Conference Record, 2005 IEEE
Conference_Location :
Fajardo
Print_ISBN :
0-7803-9221-3
DOI :
10.1109/NSSMIC.2005.1596391