DocumentCode :
2546483
Title :
Optimal File-Bundle Caching Algorithms for Data-Grids
Author :
Otoo, Ekow ; Rotem, Doron ; Romosan, Alexandru
Author_Institution :
Lawrence Berkeley National Laboratory
fYear :
2004
fDate :
06-12 Nov. 2004
Firstpage :
6
Lastpage :
6
Abstract :
The file-bundle caching problem arises frequently in scientific applications where jobs process several files concurrently. Consider a host system in a data-grid that maintains a disk cache for servicing jobs of file requests where a job is serviced only if all its requested files are present in the disk cache. Files must now be admitted into the cache and replaced in sets of file-bundles. We show that traditional caching algorithms based on file popularity measures do not perform well since they may hold in cache non-relevant combinations of files. We present and analyze a new caching algorithm for maximizing the throughput of jobs and minimizing data replacement costs at such data-grid hosts. We tested the new algorithm using a disk cache simulation model under a wide range of conditions of file request distributions, varying cache size, file size distribution, etc. The results show significant improvement over traditional caching algorithms.
Keywords :
Algorithm design and analysis; Computer networks; Cyclotrons; Distributed computing; Environmental management; High performance computing; Laboratories; Performance evaluation; Permission; Resource management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing, 2004. Proceedings of the ACM/IEEE SC2004 Conference
Print_ISBN :
0-7695-2153-3
Type :
conf
DOI :
10.1109/SC.2004.36
Filename :
1392936
Link To Document :
بازگشت