DocumentCode
668152
Title
Towards high-performance and cost-effective distributed storage systems with information dispersal algorithms
Author
Dongfang Zhao ; Burlingame, Kent ; Debains, Corentin ; Alvarez-Tabio, Pedro ; Raicu, Ioan
Author_Institution
Illinois Inst. of Technol., Chicago, IL, USA
fYear
2013
fDate
23-27 Sept. 2013
Firstpage
1
Lastpage
5
Abstract
Reliability is one of the most fundamental challenges for high performance computing (HPC) and cloud computing. Data replication is the de facto mechanism to achieve high reliability, even though it has been criticized for its high cost and low efficiency. Recent research showed promising results by switching the traditional data replication to a software-based RAID. In order to systematically study the effectiveness of this new method, we built two storage systems from the ground up: a POSIX-compliant distributed file system (FusionFS) and a distributed key-value store (IStore), both supporting information dispersal algorithms (IDA) for data redundancy. FusionFS is crafted to have excellent throughput and scalability for HPC, whereas IStore is architected mainly as a light-weight key-value storage in cloud computing. We evaluated both systems with a large number of parameter combinations. Results show that, for both HPC and cloud computing communities, IDA-based methods with current commodity hardware could outperform data replication in some cases, and would completely surpass data replication with the growing computational capacity through multi/many-core processors (e.g. Intel Xeon Phi, NVIDIA GPU).
Keywords
cloud computing; multiprocessing systems; parallel processing; redundancy; storage management; FusionFS system; HPC; IDA; IStore; POSIX-compliant distributed file system; cloud computing; data replication; distributed key-value store; distributed storage systems; high performance computing; information dispersal algorithms; lightweight key-value storage; many-core processors; multicore processors; redundant array of independent disks; software-based RAID; Artificial neural networks; Encoding; Nickel; Redundancy; Switches; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Conference_Location
Indianapolis, IN
Type
conf
DOI
10.1109/CLUSTER.2013.6702655
Filename
6702655
Link To Document