Title :
File Clustering Based Replication Algorithm in a Grid Environment
Author :
Sato, Hitoshi ; Matsuoka, Satoshi ; Endo, Toshio
Author_Institution :
Tokyo Inst. of Technol., Tokyo
Abstract :
Replication in grid file systems can significantly improve I/O performance of data-intensive applications. However, most of existing replication techniques apply to individual files, which may introduce inefficient replication overheads for a large number of files. We propose a file clustering based replication algorithm for grid file systems. Our algorithm groups files according to a relationship of simultaneous accesses between files and stores replicas of the clustered files into storage nodes, to satisfy expected most of future read access times to the clustered files and replication times for individual files being minimized under the given storage capacity limitation. Our experiments on a given grid environment, 20 nodes of 5 sites, suggest that the proposed algorithm achieves accurate file clustering and efficient replica management; our clustering policy with the file cluster size limit of 5120 MB and the storage capacity limit for replicas of 10240 MB exhibits 1.58 times efficiency than the policy that never groups related files. The results also indicate that the overheads required for introducing our algorithm significantly affect I/O performance of running applications.
Keywords :
grid computing; pattern clustering; storage management; I/O performance; data-intensive application; file clustering; grid environment; replication algorithm; storage capacity limit; Clustering algorithms; Degradation; Environmental management; File systems; Grid computing; Informatics; Integer linear programming; Large-scale systems; Physics; Processor scheduling; Data-Intensive Computing; File System; Grid; Resource Management;
Conference_Titel :
Cluster Computing and the Grid, 2009. CCGRID '09. 9th IEEE/ACM International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-3935-5
Electronic_ISBN :
978-0-7695-3622-4
DOI :
10.1109/CCGRID.2009.73