DocumentCode :
3503460
Title :
Multi-Replication with Intelligent Staging in Data-Intensive Grid Applications
Author :
Machida, Yuya ; Takizawa, Shun ; Nakada, Hidemoto ; Matsuoka, Satoshi
Author_Institution :
Tokyo Inst. of Technol.
fYear :
2006
fDate :
28-29 Sept. 2006
Firstpage :
88
Lastpage :
95
Abstract :
Existing data grid scheduling systems handle huge data I/O via replica location services coupled with simple staging, decoupled from scheduling of computing tasks. However, when the application/workflow scales, we observe considerable degradations in performance, compared to processing within a tightly-coupled cluster. For example, when numerous nodes access the same set of files simultaneously, major performance degradation occurs even if replicas are used, due to bottlenecks that manifest in the infrastructure. Instead of resorting to expensive solutions such as parallel file systems, we propose alleviating the situation by tightly coupling replica and data transfer management with computation scheduling. In particular we propose three techniques: (1) dynamic aggregation and O(1) replication of data-staging requests across multiple nodes using a multi-replication framework, (2) replica-centric scheduling - data re-use and time-to-replication as compute scheduling metrics on the grid and (3) overlapped execution of data staging and compute bound tasks. Early benchmark results implemented in our prototype Condor-like grid scheduling system demonstrate that the techniques are quite effective in eliminating much of the overhead in data transfers in many cases
Keywords :
grid computing; replicated databases; scheduling; computation scheduling; data grid scheduling systems; data reuse; data transfer management; data-intensive grid applications; data-staging request replication; dynamic aggregation; intelligent staging; multireplication framework; replica location services; replica-centric scheduling; time-to-replication; Computer industry; Concurrent computing; Databases; Degradation; Dynamic scheduling; File systems; Grid computing; Informatics; Job shop scheduling; Processor scheduling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Grid Computing, 7th IEEE/ACM International Conference on
Conference_Location :
Barcelona
Print_ISBN :
1-4244-0343-X
Electronic_ISBN :
1-4244-0344-8
Type :
conf
DOI :
10.1109/ICGRID.2006.311002
Filename :
4100459
Link To Document :
بازگشت