Title :
CRUSH: Controlled, Scalable, Decentralized Placement of Replicated Data
Author :
Weil, Sage A. ; Brandt, Scott A. ; Miller, Ethan L. ; Maltzahn, Carlos
Author_Institution :
Storage Syst. Res. Center, California Univ., Santa Cruz, CA
Abstract :
Emerging large-scale distributed storage systems are faced with the task of distributing petabytes of data among tens or hundreds of thousands of storage devices. Such systems must evenly distribute data and workload to efficiently utilize available resources and maximize system performance, while facilitating system growth and managing hardware failures. We have developed CRUSH, a scalable pseudorandom data distribution function designed for distributed object-based storage systems that efficiently maps data objects to storage devices without relying on a central directory. Because large systems are inherently dynamic, CRUSH is designed to facilitate the addition and removal of storage while minimizing unnecessary data movement. The algorithm accommodates a wide variety of data replication and reliability mechanisms and distributes data in terms of user-defined policies that enforce separation of replicas across failure domains
Keywords :
distributed object management; meta data; replicated databases; storage management; CRUSH; distributed object-based storage systems; replicated data; scalable pseudorandom data distribution function; storage devices; user-defined policies; Aggregates; Control systems; Distribution functions; Hardware; Large-scale systems; Potential well; Resource management; Robustness; Scalability; System performance;
Conference_Titel :
SC 2006 Conference, Proceedings of the ACM/IEEE
Conference_Location :
Tampa, FL
Print_ISBN :
0-7695-2700-0
Electronic_ISBN :
0-7695-2700-0