• DocumentCode
    26988
  • Title

    Proactive Data Migration for Improved Storage Availability in Large-Scale Data Centers

  • Author

    Suzhen Wu ; Hong Jiang ; Bo Mao

  • Author_Institution
    Comput. Sci. Dept., Xiamen Univ., Xiamen, China
  • Volume
    64
  • Issue
    9
  • fYear
    2015
  • fDate
    Sept. 1 2015
  • Firstpage
    2637
  • Lastpage
    2651
  • Abstract
    In face of high partial and complete disk failure rates and untimely system crashes, the executions of low-priority background tasks become increasingly frequent in large-scale data centers. However, the existing algorithms are all reactive optimizations and only exploit the temporal locality of workloads to reduce the user I/O requests during the low-priority background tasks. To address the problem, this paper proposes Intelligent Data Outsourcing (IDO), a zone-based and proactive data migration optimization, to significantly improve the efficiency of the low-priority background tasks. The main idea of IDO is to proactively identify the hot data zones of RAID-structured storage systems in the normal operational state. By leveraging the prediction tools to identify the upcoming events, IDO proactively migrates the data blocks belonging to the hot data zones on the degraded device to a surrogate RAID set in the large-scale data centers. Upon a disk failure or crash reboot, most user I/O requests addressed to the degraded RAID set can be serviced directly by the surrogate RAID set rather than the much slower degraded RAID set. Consequently, the performance of the background tasks and user I/O performance during the background tasks are improved simultaneously. Our lightweight prototype implementation of IDO and extensive trace-driven experiments on two case studies demonstrate that, compared with the existing state-of-the-art approaches, IDO effectively improves the performance of the low-priority background tasks. Moreover, IDO is portable and can be easily incorporated into any existing algorithms for RAID-structured storage systems.
  • Keywords
    computer centres; input-output programs; knowledge based systems; large-scale systems; optimisation; outsourcing; storage management; I/O requests; IDO; RAID set; crash reboot; disk failure rates; intelligent data outsourcing; large-scale data centers; low-priority background tasks; proactive data migration; reactive optimizations; storage availability; temporal locality; Availability; Computer crashes; Maintenance engineering; Optimization; Performance evaluation; Software; System performance; Availability; Low-priority Background Tasks; Low-priority background tasks; Proactive; RAID Reconstruction; RAID reconstruction; Temporal and Spatial Locality; availability; proactive; temporal and spatial locality;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2014.2366734
  • Filename
    6945889