Title :
Management of grid jobs and data within SAMGrid
Author :
Baranovski, Andrew ; Garzoglio, Gabriele ; Terekhov, Igor ; Roy, Alain ; Tannenbaum, Todd
Author_Institution :
Fermi Nat. Accelerator Lab., Batavia, IL, USA
Abstract :
When designing SAMGrid, a project for distributing high-energy physics computations on a grid, we discovered that it was challenging to decide where to place user´s jobs. Jobs typically need to access hundreds of files, and each site has a different subset of the files. Our data system SAM knows what portion of a user´s data may be at each site, but does not know how to submit grid jobs. Our job submission system Condor-G knows how to submit grid jobs, but originally it required users to choose grid sites and gave them no assistance in choosing. This work describes how we enhanced Condor-G to interact with SAM to make good decisions about where jobs should be executed, and thereby improve the performance of grid jobs that access large amounts of data. All these enhancements are general enough to be applicable to grid computing beyond the data-intensive computing with SAMGrid.
Keywords :
grid computing; high energy physics instrumentation computing; middleware; scheduling; storage management; Condor-G; SAMGrid; data intensive computing; data management; decision making; file access; grid distribution; grid job management; high-energy physics computation; job execution; job submission system; middleware; planning; scheduling; user job; Access protocols; Collaboration; Data handling; Grid computing; Information management; Job design; Laboratories; Physics; Processor scheduling; Resource management;
Conference_Titel :
Cluster Computing, 2004 IEEE International Conference on
Print_ISBN :
0-7803-8694-9
DOI :
10.1109/CLUSTR.2004.1392634