• DocumentCode
    580126
  • Title

    Adaptive and scalable metadata management to support a trillion files

  • Author

    Jing Xing ; Jin Xiong ; Ninghui Sun ; Jie Ma

  • Author_Institution
    Nat. Res. Center for Intell. Comput. Syst., Inst. of Comput. Technol., Beijing, China
  • fYear
    2009
  • fDate
    14-20 Nov. 2009
  • Firstpage
    1
  • Lastpage
    11
  • Abstract
    Nowadays more and more applications require file systems to efficiently maintain million or more files. How to provide high access performance with such a huge number of files and such large directories is a big challenge for cluster file systems. Limited by static directory structures, existing file systems will be prohibitively inefficient for this use. To address this problem, we present a scalable and adaptive metadata management system which aims to maintain a trillion files efficiently. Firstly, our system exploits an adaptive two-level directory partitioning based on extendible hashing to manage very large directories. Secondly, our system utilizes fine-grained parallel processing within a directory and greatly improves performance of file creation or deletion. Thirdly, our system uses multiple-layered metadata cache management which improves memory utilization on the servers. And finally, our system uses a dynamic loadbalance mechanism based on consistent hashing which enables our system to scale up and down easily. Our performance results on 32 metadata servers show that our user-level prototype implementation can create more than 74 thousand files per second and can get more than 270 thousand files´ attributes per second in a single directory with 100 million files. Moreover, it delivers a peak throughput of more than 60 thousand file creates/second in a single directory with 1 billion files.
  • Keywords
    cache storage; meta data; parallel processing; storage management; adaptive metadata management system; adaptive two-level directory partitioning; cluster file system; directory management; dynamic load-balance mechanism; file creation; file deletion; hashing; memory utilization; metadata server; multiple-layered metadata cache management; parallel processing; scalable metadata management system; static directory structure; user-level prototype implementation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing Networking, Storage and Analysis, Proceedings of the Conference on
  • Conference_Location
    Portland, OR
  • Type

    conf

  • DOI
    10.1145/1654059.1654086
  • Filename
    6375575