• DocumentCode
    625635
  • Title

    ZHT: A Light-Weight Reliable Persistent Dynamic Scalable Zero-Hop Distributed Hash Table

  • Author

    Tonglin Li ; Xiaobing Zhou ; Brandstatter, Kevin ; Dongfang Zhao ; Ke Wang ; Rajendran, Adithya ; Zhao Zhang ; Raicu, Ioan

  • Author_Institution
    Dept. of Comput. Sci., Illinois Inst. of Technol., Chicago, IL, USA
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    775
  • Lastpage
    787
  • Abstract
    This paper presents ZHT, a zero-hop distributed hash table, which has been tuned for the requirements of high-end computing systems. ZHT aims to be a building block for future distributed systems, such as parallel and distributed file systems, distributed job management systems, and parallel programming systems. The goals of ZHT are delivering high availability, good fault tolerance, high throughput, and low latencies, at extreme scales of millions of nodes. ZHT has some important properties, such as being light-weight, dynamically allowing nodes join and leave, fault tolerant through replication, persistent, scalable, and supporting unconventional operations such as append (providing lock-free concurrent key/value modifications) in addition to insert/lookup/remove. We have evaluated ZHT´s performance under a variety of systems, ranging from a Linux cluster with 512-cores, to an IBM Blue Gene/P supercomputer with 160K-cores. Using micro-benchmarks, we scaled ZHT up to 32K-cores with latencies of only 1.1ms and 18M operations/sec throughput. This work provides three real systems that have integrated with ZHT, and evaluate them at modest scales. 1) ZHT was used in the FusionFS distributed file system to deliver distributed meta-data management at over 60K operations (e.g. file create) per second at 2K-core scales. 2) ZHT was used in the IStore, an information dispersal algorithm enabled distributed object storage system, to manage chunk locations, delivering more than 500 chunks/sec at 32-nodes scales. 3) ZHT was also used as a building block to MATRIX, a distributed job scheduling system, delivering 5000 jobs/sec throughputs at 2K-core scales. We compared ZHT against other distributed hash tables and key/value stores and found it offers superior performance for the features and portability it supports.
  • Keywords
    Linux; distributed databases; fault tolerant computing; meta data; parallel machines; performance evaluation; processor scheduling; storage management; FusionFS distributed file system; IBM Blue Gene/P supercomputer; IStore; Linux cluster; MATRIX; ZHT performance evaluation; distributed job scheduling system; distributed meta data management; distributed object storage system; fault tolerance; high-end computing systems; information dispersal algorithm; key-value stores; microbenchmarks; throughput; zero-hop distributed hash table; Fault tolerance; Fault tolerant systems; File systems; Generators; Routing; Servers; Throughput; Distributed hash tables; high-end computing; key/value stores;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
  • Conference_Location
    Boston, MA
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-6066-1
  • Type

    conf

  • DOI
    10.1109/IPDPS.2013.110
  • Filename
    6569861