DocumentCode
2540993
Title
Management of grid jobs and data within SAMGrid
Author
Baranovski, Andrew ; Garzoglio, Gabriele ; Terekhov, Igor ; Roy, Alain ; Tannenbaum, Todd
Author_Institution
Fermi Nat. Accelerator Lab., Batavia, IL, USA
fYear
2004
fDate
20-23 Sept. 2004
Firstpage
353
Lastpage
359
Abstract
When designing SAMGrid, a project for distributing high-energy physics computations on a grid, we discovered that it was challenging to decide where to place user´s jobs. Jobs typically need to access hundreds of files, and each site has a different subset of the files. Our data system SAM knows what portion of a user´s data may be at each site, but does not know how to submit grid jobs. Our job submission system Condor-G knows how to submit grid jobs, but originally it required users to choose grid sites and gave them no assistance in choosing. This work describes how we enhanced Condor-G to interact with SAM to make good decisions about where jobs should be executed, and thereby improve the performance of grid jobs that access large amounts of data. All these enhancements are general enough to be applicable to grid computing beyond the data-intensive computing with SAMGrid.
Keywords
grid computing; high energy physics instrumentation computing; middleware; scheduling; storage management; Condor-G; SAMGrid; data intensive computing; data management; decision making; file access; grid distribution; grid job management; high-energy physics computation; job execution; job submission system; middleware; planning; scheduling; user job; Access protocols; Collaboration; Data handling; Grid computing; Information management; Job design; Laboratories; Physics; Processor scheduling; Resource management;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing, 2004 IEEE International Conference on
ISSN
1552-5244
Print_ISBN
0-7803-8694-9
Type
conf
DOI
10.1109/CLUSTR.2004.1392634
Filename
1392634
Link To Document