Title :
Towards high-performance and cost-effective distributed storage systems with information dispersal algorithms
Author :
Dongfang Zhao ; Burlingame, Kent ; Debains, Corentin ; Alvarez-Tabio, Pedro ; Raicu, Ioan
Author_Institution :
Illinois Inst. of Technol., Chicago, IL, USA
Abstract :
Reliability is one of the most fundamental challenges for high performance computing (HPC) and cloud computing. Data replication is the de facto mechanism to achieve high reliability, even though it has been criticized for its high cost and low efficiency. Recent research showed promising results by switching the traditional data replication to a software-based RAID. In order to systematically study the effectiveness of this new method, we built two storage systems from the ground up: a POSIX-compliant distributed file system (FusionFS) and a distributed key-value store (IStore), both supporting information dispersal algorithms (IDA) for data redundancy. FusionFS is crafted to have excellent throughput and scalability for HPC, whereas IStore is architected mainly as a light-weight key-value storage in cloud computing. We evaluated both systems with a large number of parameter combinations. Results show that, for both HPC and cloud computing communities, IDA-based methods with current commodity hardware could outperform data replication in some cases, and would completely surpass data replication with the growing computational capacity through multi/many-core processors (e.g. Intel Xeon Phi, NVIDIA GPU).
Keywords :
cloud computing; multiprocessing systems; parallel processing; redundancy; storage management; FusionFS system; HPC; IDA; IStore; POSIX-compliant distributed file system; cloud computing; data replication; distributed key-value store; distributed storage systems; high performance computing; information dispersal algorithms; lightweight key-value storage; many-core processors; multicore processors; redundant array of independent disks; software-based RAID; Artificial neural networks; Encoding; Nickel; Redundancy; Switches; Writing;
Conference_Titel :
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Conference_Location :
Indianapolis, IN
DOI :
10.1109/CLUSTER.2013.6702655