DocumentCode :
580126
Title :
Adaptive and scalable metadata management to support a trillion files
Author :
Jing Xing ; Jin Xiong ; Ninghui Sun ; Jie Ma
Author_Institution :
Nat. Res. Center for Intell. Comput. Syst., Inst. of Comput. Technol., Beijing, China
fYear :
2009
fDate :
14-20 Nov. 2009
Firstpage :
1
Lastpage :
11
Abstract :
Nowadays more and more applications require file systems to efficiently maintain million or more files. How to provide high access performance with such a huge number of files and such large directories is a big challenge for cluster file systems. Limited by static directory structures, existing file systems will be prohibitively inefficient for this use. To address this problem, we present a scalable and adaptive metadata management system which aims to maintain a trillion files efficiently. Firstly, our system exploits an adaptive two-level directory partitioning based on extendible hashing to manage very large directories. Secondly, our system utilizes fine-grained parallel processing within a directory and greatly improves performance of file creation or deletion. Thirdly, our system uses multiple-layered metadata cache management which improves memory utilization on the servers. And finally, our system uses a dynamic loadbalance mechanism based on consistent hashing which enables our system to scale up and down easily. Our performance results on 32 metadata servers show that our user-level prototype implementation can create more than 74 thousand files per second and can get more than 270 thousand files´ attributes per second in a single directory with 100 million files. Moreover, it delivers a peak throughput of more than 60 thousand file creates/second in a single directory with 1 billion files.
Keywords :
cache storage; meta data; parallel processing; storage management; adaptive metadata management system; adaptive two-level directory partitioning; cluster file system; directory management; dynamic load-balance mechanism; file creation; file deletion; hashing; memory utilization; metadata server; multiple-layered metadata cache management; parallel processing; scalable metadata management system; static directory structure; user-level prototype implementation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing Networking, Storage and Analysis, Proceedings of the Conference on
Conference_Location :
Portland, OR
Type :
conf
DOI :
10.1145/1654059.1654086
Filename :
6375575
Link To Document :
بازگشت