• DocumentCode
    3539802
  • Title

    Efficient scrub mechanisms for error-prone emerging memories

  • Author

    Awasthi, Manu ; Shevgoor, Manjunath ; Sudan, Kshitij ; Rajendran, Bipin ; Balasubramonian, Rajeev ; Srinivasan, Viji

  • fYear
    2012
  • fDate
    25-29 Feb. 2012
  • Firstpage
    1
  • Lastpage
    12
  • Abstract
    Many memory cell technologies are being considered as possible replacements for DRAM and Flash technologies, both of which are nearing their scaling limits. While these new cells (PCM, STT-RAM, FeRAM, etc.) promise high density, better scaling, and non-volatility, they introduce new challenges. Solutions at the architecture level can help address some of these problems; e.g., prior research has proposed wear-leveling and hard error tolerance mechanisms to overcome the limited write endurance of PCM cells. In this paper, we focus on the soft error problem in PCM, a topic that has received little attention in the architecture community. Soft errors in DRAM memories are typically addressed by having SECDED support and a scrub mechanism. The scrub mechanism scans the memory looking for a single-bit error and corrects it before the line experiences a second uncorrectable error. However, PCM (and other emerging memories) are prone to new sources of soft errors. In particular, multi-level cell (MLC) PCM devices will suffer from resistance drift, that increases the soft error rate and incurs high overheads for the scrub mechanism. This paper is the first to study the design of architectural scrub mechanisms, especially when tailored to the drift phenomenon in MLC PCM. Many of our solutions will also apply to other soft-error prone emerging memories. We first show that scrub overheads can be reduced with support for strong ECC codes and a lightweight error detection operation. We then design different scrub algorithms that can adaptively trade-off soft and hard errors. Using an approach that combines all proposed solutions, our scrub mechanism yields a 96.5% reduction in uncorrectable errors, a 24.4 × decrease in scrub-related writes, and a 37.8% reduction in scrub energy, relative to a basic scrub algorithm used in modern DRAM systems.
  • Keywords
    DRAM chips; error correction codes; fault tolerant computing; flash memories; memory architecture; DRAM memories; DRAM technologies; ECC codes; FeRAM; PCM cells; SECDED support; STT-RAM; architectural scrub mechanisms; architecture level; error detection operation; flash technologies; hard error tolerance mechanism; memory cell technologies; multilevel cell PCM devices; resistance drift; scaling limits; second uncorrectable error; single-bit error; soft error problem; soft error rate; soft-error prone emerging memories; wear-leveling mechanism; Computer architecture; Error analysis; Microprocessors; Phase change materials; Random access memory; Resistance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computer Architecture (HPCA), 2012 IEEE 18th International Symposium on
  • Conference_Location
    New Orleans, LA
  • ISSN
    1530-0897
  • Print_ISBN
    978-1-4673-0827-4
  • Electronic_ISBN
    1530-0897
  • Type

    conf

  • DOI
    10.1109/HPCA.2012.6168941
  • Filename
    6168941