DocumentCode :
262763
Title :
Efficient RAS support for die-stacked DRAM
Author :
Jeon, Hyeran ; Loh, Gabriel H. ; Annavaram, Murali
Author_Institution :
Ming Hsieh Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
fYear :
2014
fDate :
20-23 Oct. 2014
Firstpage :
1
Lastpage :
10
Abstract :
Die-stacked (3D) DRAM is a promising memory architecture to satisfy the high bandwidth and low latency needs of many computing systems. However, with technology scaling, memory devices are expected to experience significant increase in single and multi-bit errors. 3D DRAM will have the added burden of protecting against single through-silicon-via (TSV) failures, which translate into multiple bit errors in a single cache line, as well as multiple TSV failures that may lead to an entire channel failure. To exploit the wide interface capability of 3D DRAM, large chunks of data are laid out contiguously in a single channel and an entire cache line is sourced from a single channel. Conventional approaches such as ECC DIMM and chipkill-correct are inefficient in the context of 3D DRAM because they spread data across multiple DRAM layers to protect against failures and also place restrictions on the number of memory layers that must be protected together. This paper proposes a two-level error correction technique while taking into account 3D DRAM´s unique organization. First, we propose a new symbol-based ECC layout to cover 3D DRAM specific errors as well as various well-known DRAM failure modes. Then, an XOR based correction code (XCC) is used to correct the multi-bit errors that are not correctable by the symbol based ECC. To take advantage of 3D DRAM´s channel level parallelism, a permutation-based ECC placement is used. As an optimization, an ECC cache and decoupled XCC update are used for improving read and write performance without compromising reliability. The proposed approaches effectively reduce the FIT rate with almost negligible performance overhead.
Keywords :
DRAM chips; cache storage; error correction codes; logic design; three-dimensional integrated circuits; 3D DRAM; DRAM failure modes; ECC DIMM; ECC cache; RAS support; XOR based correction code; cache line; channel failure; channel level parallelism; computing systems; decoupled XCC update; die-stacked DRAM; memory architecture; memory devices; memory layers; multibit errors; multiple TSV failures; multiple bit errors; permutation-based ECC placement; read performance; symbol-based ECC layout; technology scaling; through-silicon-via failures; two-level error correction technique; wide interface capability; write performance; DRAM chips; Error correction codes; Layout; Organizations; Three-dimensional displays; Through-silicon vias;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Test Conference (ITC), 2014 IEEE International
Conference_Location :
Seattle, WA
Type :
conf
DOI :
10.1109/TEST.2014.7035318
Filename :
7035318
Link To Document :
بازگشت