DocumentCode :
2235922
Title :
Filecules in High-Energy Physics: Characteristics and Impact on Resource Management
Author :
Aamnitchi, A. ; Doraimani, Shyamala ; Garzoglio, Gabriele
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL
fYear :
0
fDate :
0-0 0
Firstpage :
69
Lastpage :
80
Abstract :
Grid computing has reached the stage where deployments are mature and many collaborations run in production mode. Mature grid deployments offer the opportunity for revisiting and perhaps updating traditional beliefs related to workload models, which in turn leads to the re-evaluation of traditional resource management techniques. This paper analyzes usage patterns in a typical grid community, a large-scale data-intensive scientific collaboration in high-energy physics. We focus mainly on data usage, since data is the major resource for this class of applications. Our observations led us to propose a new abstraction for resource management in scientific data analysis applications: we define a filecule as a group of files that is always used together. We show that filecules exist and present their characteristics. The existence of filecules suggests a new granularity for data management, which, if incorporated in design, can significantly outperform the traditional solutions for data caching, replication and placement based on single-file granularity. We reason about the impact of filecules on resource management and show compelling evidence for using this abstraction when designing data management services
Keywords :
database management systems; grid computing; resource allocation; scientific information systems; storage management; data caching; data management service; data replication; grid computing; high-energy physics; large-scale data-intensive scientific analysis application; resource management; single-file granularity; Collaborative work; Computer science; Data analysis; Engineering management; Grid computing; Laboratories; Large-scale systems; Pattern analysis; Physics; Resource management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Distributed Computing, 2006 15th IEEE International Symposium on
Conference_Location :
Paris
ISSN :
1082-8907
Print_ISBN :
1-4244-0307-3
Type :
conf
DOI :
10.1109/HPDC.2006.1652137
Filename :
1652137
Link To Document :
بازگشت