• DocumentCode
    3172032
  • Title

    A hierarchical approach for dependability analysis of a commercial cache-based RAID storage architecture

  • Author

    Kaâniche, M. ; Romano, L. ; Kalbarczyk, Z. ; Iyer, R. ; Karcich, R.

  • Author_Institution
    Lab. d´´Autom. et d´´Anal. des Syst., CNRS, Toulouse, France
  • fYear
    1998
  • fDate
    23-25 June 1998
  • Firstpage
    6
  • Lastpage
    15
  • Abstract
    We present a hierarchical simulation approach for the dependability analysis and evaluation of a highly available commercial cache-based RAID storage system. The architecture is complex and includes several layers of overlapping error detection and recovery mechanisms. Three abstraction levels have been developed to model the cache architecture, cache operations, and error detection and recovery mechanism. The impact of faults and errors occurring in the cache and in the disks is analyzed at each level of the hierarchy. A simulation submodel is associated with each abstraction level. The models have been developed using DEPEND, a simulation-based environment for system-level dependability analysis, which provides facilities to inject faults into a functional behavior model, to simulate error detection and recovery mechanisms, and to evaluate quantitative measures. Several fault models are defined for each submodel to simulate cache component failures, disk failures, transmission errors, and data errors in the cache memory and in the disks. Some of the parameters characterizing fault injection in a given submodel correspond to probabilities evaluated from the simulation of the lower-level submodel. Based on the proposed methodology, we evaluate and analyze: the system behavior under a real workload and high error rate (focusing on error bursts); the coverage of the error detection mechanisms implemented in the system and the error latency distributions; and the accumulation of errors in the cache and in the disks.
  • Keywords
    cache storage; fault tolerant computing; magnetic disc storage; memory architecture; system recovery; virtual machines; DEPEND; RAID; cache storage; data errors; dependability analysis; disk failure; error detection; error latency distributions; error recovery; hierarchical simulation; storage architecture; system behavior; transmission error; Analytical models; Cache storage; Computational modeling; Computer architecture; Delay; Error analysis; Error correction; Fault detection; Fault tolerance; Performance analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Computing, 1998. Digest of Papers. Twenty-Eighth Annual International Symposium on
  • Conference_Location
    Munich, Germany
  • ISSN
    0731-3071
  • Print_ISBN
    0-8186-8470-4
  • Type

    conf

  • DOI
    10.1109/FTCS.1998.689450
  • Filename
    689450