• DocumentCode
    187014
  • Title

    ZooFence: Principled Service Partitioning and Application to the ZooKeeper Coordination Service

  • Author

    Halalai, Raluca ; Sutra, Pierre ; Riviere, Etienne ; Felber, Pascal

  • Author_Institution
    Univ. of Neuchatel, Neuchatel, Switzerland
  • fYear
    2014
  • fDate
    6-9 Oct. 2014
  • Firstpage
    67
  • Lastpage
    78
  • Abstract
    Cloud computing infrastructures leverage fault-tolerant and geographically distributed services in order to meet the requirements of modern applications. Each service deals with a large number of clients that compete for the resources it offers. When the load increases, the service needs to scale. In this paper, we investigate a scalability solution which consists in partitioning the service state. We formulate specific conditions under which a service is partitionable. Then, we present a general algorithm to build a dependable and consistent partitioned service. To assess the practicability of our approach, we implement and evaluate the ZooFence coordination service. ZooFence orchestrates several instances of ZooKeeper and presents the exact same API and semantics to its clients. It automatically splits the coordination service state among ZooKeeper instances while being transparent to the application. By reducing the convoy effect on operations and leveraging the workload locality, our approach allows proposing a coordination service with a greater scalability than with a single ZooKeeper instance. The evaluation of ZooFence assesses this claim for two benchmarks, a synthetic service of concurrent queues and the BookKeeper distributed logging engine.
  • Keywords
    application program interfaces; cloud computing; queueing theory; API; BookKeeper distributed logging engine; ZooFence; ZooKeeper coordination service; automatically coordinated service state splitting; cloud computing infrastructures; concurrent queues; convoy effect reduction; dependable consistent partitioned service; fault-tolerant services; general algorithm; geographically distributed services; principled service state partitioning; service scalability; service semantics; synthetic service; workload locality leveraging; Banking; History; Nominations and elections; Parallel processing; Partitioning algorithms; Semantics; Synchronization; coordination service; state partitioning; zookeeper;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems (SRDS), 2014 IEEE 33rd International Symposium on
  • Conference_Location
    Nara
  • Type

    conf

  • DOI
    10.1109/SRDS.2014.41
  • Filename
    6983381