• DocumentCode
    3138630
  • Title

    Reliability-aware deduplication storage: Assuring chunk reliability and chunk loss severity

  • Author

    Nam, Youngjin ; Lu, Guanlin ; Du, David H C

  • Author_Institution
    Sch. of Comput. & Inf. Technol., Daegu Univ., Gyeongsan, South Korea
  • fYear
    2011
  • fDate
    25-28 July 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Reliability in deduplication storage has not attracted much research attention yet. To provide a demanded reliability for an incoming data stream, most deduplication storage systems first carry out deduplication process by eliminating duplicates from the data stream and then apply erasure coding for the remaining (unique) chunks. A unique chunk may be shared (i.e., duplicated) at many places of the data stream and shared by other data streams. That is why deduplication can reduce the required storage capacity. However, this occasionally becomes problematic to assure certain reliability levels required from different data streams. We introduce two reliability parameters for deduplication storage: chunk reliability and chunk loss severity. The chunk reliability means each chunk´s tolerance level in the face of any failures. The chunk loss severity represents an expected damage level in the event of a chunk loss, formally defined as the multiplication of actual damage by the probability of a chunk loss. We propose a reliability-aware deduplication solution that not only assures all demanded chunk reliability levels by making already existing chunks sharable only if its reliability is high enough, but also mitigates the chunk loss severity by adaptively reducing the probability of having a chunk loss. In addition, we provide future research directions following to the current study.
  • Keywords
    data handling; reliability; storage management; chunk loss severity; chunk reliability; chunk reliability level; data stream; demanded reliability; erasure coding; incoming data stream; reliability aware deduplication storage; Containers; Electronic mail; Encoding; Home appliances; Indexes; Performance analysis; Reliability; deduplication; loss severity; reliability; storage;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Green Computing Conference and Workshops (IGCC), 2011 International
  • Conference_Location
    Orlando, FL
  • Print_ISBN
    978-1-4577-1222-7
  • Type

    conf

  • DOI
    10.1109/IGCC.2011.6008566
  • Filename
    6008566