• DocumentCode
    668152
  • Title

    Towards high-performance and cost-effective distributed storage systems with information dispersal algorithms

  • Author

    Dongfang Zhao ; Burlingame, Kent ; Debains, Corentin ; Alvarez-Tabio, Pedro ; Raicu, Ioan

  • Author_Institution
    Illinois Inst. of Technol., Chicago, IL, USA
  • fYear
    2013
  • fDate
    23-27 Sept. 2013
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Reliability is one of the most fundamental challenges for high performance computing (HPC) and cloud computing. Data replication is the de facto mechanism to achieve high reliability, even though it has been criticized for its high cost and low efficiency. Recent research showed promising results by switching the traditional data replication to a software-based RAID. In order to systematically study the effectiveness of this new method, we built two storage systems from the ground up: a POSIX-compliant distributed file system (FusionFS) and a distributed key-value store (IStore), both supporting information dispersal algorithms (IDA) for data redundancy. FusionFS is crafted to have excellent throughput and scalability for HPC, whereas IStore is architected mainly as a light-weight key-value storage in cloud computing. We evaluated both systems with a large number of parameter combinations. Results show that, for both HPC and cloud computing communities, IDA-based methods with current commodity hardware could outperform data replication in some cases, and would completely surpass data replication with the growing computational capacity through multi/many-core processors (e.g. Intel Xeon Phi, NVIDIA GPU).
  • Keywords
    cloud computing; multiprocessing systems; parallel processing; redundancy; storage management; FusionFS system; HPC; IDA; IStore; POSIX-compliant distributed file system; cloud computing; data replication; distributed key-value store; distributed storage systems; high performance computing; information dispersal algorithms; lightweight key-value storage; many-core processors; multicore processors; redundant array of independent disks; software-based RAID; Artificial neural networks; Encoding; Nickel; Redundancy; Switches; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing (CLUSTER), 2013 IEEE International Conference on
  • Conference_Location
    Indianapolis, IN
  • Type

    conf

  • DOI
    10.1109/CLUSTER.2013.6702655
  • Filename
    6702655