• DocumentCode
    2806032
  • Title

    A cache error propagation model

  • Author

    Somani, Arun K. ; Trivedi, Kishor S.

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Eng., Iowa State Univ., Ames, IA, USA
  • fYear
    1997
  • fDate
    15-16 Dec 1997
  • Firstpage
    15
  • Lastpage
    21
  • Abstract
    Cache memory is a small, fast, memory system that holds frequently used data. With increasing processor speed, aggressive design practices increase the probability of fault occurrence and the presence of latent errors as the processor allows a short duration for read and write. The fault may corrupt the cache memory system or lead to an erroneous internal CPU state. The authors investigate error propagation in the cache memory system due to transient faults either in the cache memory itself or in the processor´s registers or both. The information gained from such an investigation should lead to the development of more effective error recovery mechanisms against failures due to transient faults arising in the machine´s cache memory and register set. They establish that even though the computer system is capable of recovering about 50% of the time from the effect of a single erroneous cache location/processor register, the other 50% of the time error recovery is affected only through specific recovery mechanisms. Their results are obtained using both a discrete-time Markov model and by means of error injection on a real system
  • Keywords
    cache storage; fault tolerant computing; system recovery; cache error propagation model; cache memory; discrete-time Markov model; erroneous internal CPU state; error injection; error propagation; error recovery mechanisms; fault occurrence; frequently used data; latent errors; processor registers; register set; time error recovery; transient faults; Cache memory; Computer applications; Computer errors; Control systems; NASA; Personal communication networks; Registers; Size control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fault-Tolerant Systems, 1997. Proceedings., Pacific Rim International Symposium on
  • Conference_Location
    Taipei
  • Print_ISBN
    0-8186-8212-4
  • Type

    conf

  • DOI
    10.1109/PRFTS.1997.640119
  • Filename
    640119