DocumentCode :
720568
Title :
Towards Elasticity in Distributed File Systems
Author :
Seguin, Cyril ; Le Mahec, Gael ; Depardon, Benjamin
Author_Institution :
MIS Lab., UPJV, Villeurbanne, France
fYear :
2015
fDate :
4-7 May 2015
Firstpage :
1047
Lastpage :
1056
Abstract :
In IaaS Cloud Computing platforms, elasticity offers to users the possibility to adjust the number of resources tithe current workload, taking into account peak (high activity) and trough (low activity) periods by powering down/up someresources. This elasticity principally consists in dynamically starting/stopping Virtual Machines to increase/reduce the computing capacities. In this paper, we study the problem of storage resources elasticity using a Distributed File System (DFS). Indeed, the main DFSes that are used today, like Glister, HDFS or Lustre, focus on data access performance and transient fault tolerance, but they do not take into account intentional and dynamic removal/addition of resources. We propose an algorithm that provides an initial data placement and adapts the resources of a DFS to the workload of a platform maintaining good data access performance. This algorithm takes into account an estimated popularity of the data stored on the DFS to distribute the workload using the least resources. Our simulations on top of SimGrid show that we could improve HDFS´ performances by up to 41% while using less resources (up to 63%). Then, we introduce our solutions that adapt the replication factor of each data to their popularity, allowing more parallel accesses when needed and easing the removal of resources when some data have few concurrent accesses. The simulations show we could either reduce the number of used resources (up to 35%) or increase performances (up to 54%).
Keywords :
cloud computing; distributed databases; fault tolerant computing; resource allocation; virtual machines; Gluster; HDFS; IaaS cloud computing platforms; Lustre; SimGrid; computing capacities; data access performance; data placement; distributed file systems; dynamic resource removal-addition; intentional resource removal-addition; parallel accesses; replication factor; transient fault tolerance; virtual machines; Bandwidth; Cloud computing; Distributed databases; Elasticity; Fault tolerance; Rabbits; Servers; Data Popularity; Distributed File System; Elasticity; Read Performances;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
Conference_Location :
Shenzhen
Type :
conf
DOI :
10.1109/CCGrid.2015.140
Filename :
7152591
Link To Document :
بازگشت