DocumentCode :
3575215
Title :
Evolution Towards Distributed Storage in a Nutshell
Author :
Andrei, Pistirica Sorin ; Victor, Asavei ; Horia, Geanta ; Florica, Moldoveanu ; Alin, Moldoveanu ; Catalin, Negru ; Mariana, Mocanu
Author_Institution :
Fac. of Autom. Control & Comput., Univ. Politeh. of Bucharest, Bucharest, Romania
fYear :
2014
Firstpage :
1267
Lastpage :
1274
Abstract :
Distributed storage systems have greatly evolveddue to cloud computing upsurge in the past several years. Thedistributed file systems inherit many components fromcentralized ones and use them in a distributed manner. There aretwo ways to grow the storage capacity: by scaling-up or byscaling-out and growing the number of storage devices in astorage system. The growth of storage devices impose manychallenges related to interconnection protocols and topologies,error handling, data consistency, security and so on. In thisarticle we have studied how distributed and parallel storageshave evolved from direct connected storages in terms ofarchitecture, data management and organization and how thenew challenges imposed by data distribution have been solved.We have selected for studying several of the most representativedistributed storages solutions: Andrew File System, Google FileSystem, General Parallel File System, Lustre and Ceph. First, weemphasize how a generic distributed storage layout has inspiredfrom structured disk layout (Berkeley Fast File System). Second,we describe the evolution path of distributed storages from awide variety of perspectives, including: distributed units whichare moving from blocks to objects due to their undeniableadvantages or distribution methods that have evolved from listsmuch like inode mapping to deterministic hash functions likeRUSH or CRUSH. Third, the networks are evolving very fast interms of topologies and protocols. Using graph theory,researchers are continuously improving different aspects ofcluster networks. Fourth, storage security is a critical componentdue to the demand of storing sensitive data for long term, sharingit in a secure way and impacting as little as possible the systemperformance.
Keywords :
distributed databases; file organisation; Andrew File System; Berkeley Fast File System; CRUSH; Ceph; General Parallel File System; Google File System; Lustre; Nutshell; RUSH; cloud computing; cluster networks; data distribution; data management; deterministic hash functions; direct connected storages; distributed file systems; distributed storage systems; distributed units; distribution methods; generic distributed storage layout; graph theory; inode mapping; parallel storage; scaling-out; scaling-up; sensitive data storage; storage capacity; storage devices; storage security; structured disk layout; system architecture; system organization; Availability; Distributed databases; File systems; Organizations; Security; Semantics; Servers; Data Center; Distributed File System; File System;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on
Print_ISBN :
978-1-4799-6122-1
Type :
conf
DOI :
10.1109/HPCC.2014.216
Filename :
7056906
Link To Document :
بازگشت