• DocumentCode
    2177795
  • Title

    Building Fast, Dense, Low-Power Caches Using Erasure-Based Inline Multi-bit ECC

  • Author

    Jangwoo Kim ; Hyunggyun Yang ; Mccartney, Mark P. ; Bhargava, Mudit ; Ken Mai ; Falsafi, Babak

  • Author_Institution
    High Performance Comput. Lab. (HPC), POSTECH, Pohang, South Korea
  • fYear
    2013
  • fDate
    2-4 Dec. 2013
  • Firstpage
    98
  • Lastpage
    107
  • Abstract
    The embedded memory hierarchy of microprocessors and systems-on-a-chip plays a critical role in the overall system performance, area, power, resilience, and yield. However, as process technologies scale down to nanometer-regime geometries, the design and implementation of the embedded memory system are becoming increasingly difficult due to a number of exacerbating factors including increasing process variability, manufacturing defects, device wear out, and susceptibility to energetic particle strikes. Consequently, conventional memory resilience techniques will be unable to counter the raw bit error rate of the memory arrays in future technologies at economically feasible design points. Error correcting codes (ECC) are a widely-used and effective technique for correcting memory errors, but using conventional ECC techniques to correct more than one bit per word incurs high latency, area, and power overheads. In this work, we propose a novel ECC scheme based on erasure coding that can extend ECC to correct and detect multiple erroneous bits at low latency, area, and power overheads. Our results show that the increased memory resilience afforded by erasure-based ECC (EB-ECC) can be traded off to boost the memory performance, area, power, and yield. We show that EB-ECC, when combined with less than 5% row redundancy, can improve the cache access latency, power, and stability by over 40% on average, while maintaining near 100% yield and runtime reliability.
  • Keywords
    cache storage; error correction codes; multiprocessing systems; system-on-chip; embedded memory hierarchy; embedded memory system; erasure coding; erasure-based inline multibit ECC; error correcting codes; memory resilience techniques; microprocessors; novel ECC scheme; raw bit error rate; systems-on-a-chip; Decoding; Encoding; Error correction codes; Memory management; Redundancy; Resilience; Runtime; ECC; modeling; resilience; yield;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Computing (PRDC), 2013 IEEE 19th Pacific Rim International Symposium on
  • Conference_Location
    Vancouver, BC
  • Type

    conf

  • DOI
    10.1109/PRDC.2013.19
  • Filename
    6820845