• DocumentCode
    2149778
  • Title

    Exploiting replicated checkpoints for soft error detection and correction

  • Author

    Koc, Fahrettin ; Bozdas, Kenan ; Karsli, Burak ; Ergin, Oguz

  • Author_Institution
    Department of Computer Engineering, TOBB University of Economics and Technology, Ankara, TURKEY
  • fYear
    2013
  • fDate
    18-22 March 2013
  • Firstpage
    1494
  • Lastpage
    1497
  • Abstract
    Register renaming is a widely used technique to remove false dependencies in contemporary superscalar microprocessors. A register alias table (RAT) is formed to hold current locations of the values that correspond to the architectural registers. Some recently designed processors take a copy of the rename table at each branch instruction, in order to recover its contents when a misspeculation occurs. In this paper first we investigate the RAT vulnerability against transient errors. Then we analyze the vulnerability of RAT checkpoints and propose two techniques for soft error detection and correction utilizing redundantly taken copies of the entries whose content is the same with the previous and/or next checkpoints. Simulation results of the spec 2006 benchmarks reveal that on the average RAT vulnerability is 25% and checkpoint vulnerability is 6%. Results also reveal that redundancy exists at sequential checkpoint copies and can be used for error detection and correction purposes. We propose techniques that exploit this redundancy and show that faults in 41% of all checkpoints and 44% of rolled-back checkpoints can be detected and errors in 33% of the rolled-back checkpoints can be corrected. Since we exploit the already available storage, proposed error detection and correction techniques can be implemented with minimal hardware overhead.
  • Keywords
    Arrays; Benchmark testing; Hardware; Microprocessors; Random access memory; Redundancy; Registers; Checkpoint; Error Detection and Correction; Microprocessors; RAT Vulnerability; Register Rename; Soft Error;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Design, Automation & Test in Europe Conference & Exhibition (DATE), 2013
  • Conference_Location
    Grenoble, France
  • ISSN
    1530-1591
  • Print_ISBN
    978-1-4673-5071-6
  • Type

    conf

  • DOI
    10.7873/DATE.2013.304
  • Filename
    6513750