DocumentCode
451277
Title
An Efficient Data Location Protocol for Self.organizing Storage Clusters
Author
Tang, Hong ; Yang, Tao
Author_Institution
University of California, Santa Barbara
fYear
2003
fDate
15-21 Nov. 2003
Firstpage
53
Lastpage
53
Abstract
Component additions and failures are common for large-scale storage clusters in production environments. To improve availability and manageability, we investigate and compare data location schemes for a large self-organizing storage cluster that can quickly adapt to the additions or departures of storage nodes. We further present an efficient location scheme that differentiates between small and large file blocks for reduced management overhead compared to uniform strategies. In our protocol, small blocks, which are typically in large quantities, are placed through consistent hashing. Large blocks, much fewer in practice, are placed through a usage-based policy, and their locations are tracked by Bloom filters. The proposed scheme results in improved storage utilization even with non-uniform cluster nodes. To achieve high scalability and fault resilience, this protocol is fully distributed, relies only on soft states, and supports data replication. We demonstrate the effectiveness and efficiency of this protocol through trace-driven simulation.
Keywords
Application software; Availability; Computer science; Image storage; Large-scale systems; Local area networks; Peer to peer computing; Permission; Production; Protocols;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, 2003 ACM/IEEE Conference
Print_ISBN
1-58113-695-1
Type
conf
DOI
10.1109/SC.2003.10003
Filename
1592956
Link To Document