• DocumentCode
    3000738
  • Title

    Analysis of Data Reliability Tradeoffs in Hybrid Distributed Storage Systems

  • Author

    Tang, Bing ; Fedak, Gilles

  • Author_Institution
    INRIA, LIP, Univ. of Lyon, Lyon, France
  • fYear
    2012
  • fDate
    21-25 May 2012
  • Firstpage
    1546
  • Lastpage
    1555
  • Abstract
    This paper surveys previous distributed storage systems and related data redundancy and fault-tolerance schemes which are introduced to overcome the impact of host churn on data reliability. Furthermore, a hybrid storage system model is proposed which offers a reliable data storage service by integrating idle storage contributed by volatile peer nodes and stable and durable storage utilities. In order to ensure high availability and durability for this hybrid storage system, we explore four reliability improvement strategies, including File Replica Strategy, File Encoding Strategy, Replica Repair Strategy, and Stable-Volatile Strategy, as well as the combination of these four strategies. Extensive simulations based on real traces are performed, in which data availability, data durability, and storage overhead are evaluated. Simulation results show that compared with previous peer-to-peer storage systems, the proposed hybrid storage system could achieve a higher availability and durability with less storage consumption, due to proposed new strategies. Finally, taking into account storage and traffic cost, the tradeoffs between storage efficiency and reliability are discussed.
  • Keywords
    fault tolerant computing; peer-to-peer computing; storage management; data availability; data durability; data redundancy; data reliability tradeoff; durable storage utilities; fault-tolerance scheme; file encoding strategy; file replica strategy; hybrid distributed storage system; hybrid storage system model; idle storage; peer-to-peer storage system; reliability improvement strategies; reliable data storage service; replica repair strategy; stable storage utilities; stable-volatile strategy; storage consumption; storage efficiency; storage overhead; storage reliability; traffic cost; volatile peer nodes; Availability; Data models; Maintenance engineering; Peer to peer computing; Redundancy; Synchronization; Availability; Durability; Hybrid Storage System; Peer-to-Peer; Reliability Tradeoff;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-0974-5
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2012.195
  • Filename
    6270826