مرکز منطقه ای اطلاع رساني علوم و فناوري - Towards Elasticity in Distributed File Systems

DocumentCode :

720568

Title :

Towards Elasticity in Distributed File Systems

Author :

Seguin, Cyril ; Le Mahec, Gael ; Depardon, Benjamin

Author_Institution :

MIS Lab., UPJV, Villeurbanne, France

fYear :

2015

fDate :

4-7 May 2015

Firstpage :

1047

Lastpage :

1056

Abstract :

In IaaS Cloud Computing platforms, elasticity offers to users the possibility to adjust the number of resources tithe current workload, taking into account peak (high activity) and trough (low activity) periods by powering down/up someresources. This elasticity principally consists in dynamically starting/stopping Virtual Machines to increase/reduce the computing capacities. In this paper, we study the problem of storage resources elasticity using a Distributed File System (DFS). Indeed, the main DFSes that are used today, like Glister, HDFS or Lustre, focus on data access performance and transient fault tolerance, but they do not take into account intentional and dynamic removal/addition of resources. We propose an algorithm that provides an initial data placement and adapts the resources of a DFS to the workload of a platform maintaining good data access performance. This algorithm takes into account an estimated popularity of the data stored on the DFS to distribute the workload using the least resources. Our simulations on top of SimGrid show that we could improve HDFS´ performances by up to 41% while using less resources (up to 63%). Then, we introduce our solutions that adapt the replication factor of each data to their popularity, allowing more parallel accesses when needed and easing the removal of resources when some data have few concurrent accesses. The simulations show we could either reduce the number of used resources (up to 35%) or increase performances (up to 54%).

Keywords :

cloud computing; distributed databases; fault tolerant computing; resource allocation; virtual machines; Gluster; HDFS; IaaS cloud computing platforms; Lustre; SimGrid; computing capacities; data access performance; data placement; distributed file systems; dynamic resource removal-addition; intentional resource removal-addition; parallel accesses; replication factor; transient fault tolerance; virtual machines; Bandwidth; Cloud computing; Distributed databases; Elasticity; Fault tolerance; Rabbits; Servers; Data Popularity; Distributed File System; Elasticity; Read Performances;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on

Conference_Location :

Shenzhen

Type :

conf

DOI :

10.1109/CCGrid.2015.140

Filename :

7152591

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=720568