Title :
Dynamic Adaptation for Elastic System Services Using Virtual Servers
Author :
Abhishek Kulkarni;Hugh Greenberg;Michael Lang;Andrew Lumsdaine
Abstract :
A vast majority of legacy runtime systems and middleware prevalent in cluster and supercomputing environments are static in nature. Due to the rising scale and complexity of high-performance computing systems, the static nature of systems software would prospectively impede its scalability and resilience. Traditionally, the mobility of servers is further limited since services are statically bound to specific communication endpoints. To address these challenges imminent for exascale-class systems, distributed middleware needs to support dynamic reconfiguration, redundant and replicated state, and adaptation where the number of servers can vary according to the load in the system. We identify the key features necessary from the underlying network infrastructure to support dynamic adaptation and elasticity in distributed system software, and describe the implementation of a high-performance middleware library that implements the proposed interface. We discuss several novel approaches for dynamic resolution using range computations performed by hosts (in software) and by switches (in hardware), and compare the performance on contemporary Ethernet networks. Finally, we validate the benefits offered by our library with two different applications -- a scalable DHCP server and an elastic key-value store.
Keywords :
"Servers","Libraries","System software","Resilience","Complexity theory","Elasticity","Routing"
Conference_Titel :
High Performance Computing (HiPC), 2015 IEEE 22nd International Conference on
DOI :
10.1109/HiPC.2015.46