DocumentCode :
2480155
Title :
Flash correct-and-refresh: Retention-aware error management for increased flash memory lifetime
Author :
Cai, Yu ; Yalcin, Gulay ; Mutlu, Onur ; Haratsch, Erich F. ; Cristal, Adrian ; Unsal, Osman S. ; Mai, Ken
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2012
fDate :
Sept. 30 2012-Oct. 3 2012
Firstpage :
94
Lastpage :
101
Abstract :
With the continued scaling of NAND flash and multi-level cell technology, flash-based storage has gained widespread use in systems ranging from mobile platforms to enterprise servers. However, the robustness of NAND flash cells is an increasing concern, especially at nanometer-regime process geometries. NAND flash memory bit error rate increases exponentially with the number of program/erase cycles. Stronger error correcting codes (ECC) can be used to tolerate higher error rates, but these have diminishing returns with increasing P/E cycles and can have prohibitively high power, area, and latency overheads. The goal of this paper is to develop new techniques that can tolerate high bit error rates without requiring prohibitively strong ECC. Our techniques, called Flash Correct-and-Refresh (FCR) exploit the observation that the dominant error source in NAND flash memory is retention errors, caused by flash cells losing charge over time. The key idea is to periodically read, correct, and reprogram (in-place) or remap the stored data before it accumulates more retention errors than can be corrected by simple ECC. Detailed simulations of a solid-state drive (SSD) storage system driven by measured experimental data from error characterization on real flash memory chips show that our techniques provide 46× average lifetime improvement on a variety of workloads at no additional hardware cost. We also find that our techniques achieve lifetime improvements that cannot feasibly be achieved with stronger ECC.
Keywords :
NAND circuits; error correction codes; error statistics; flash memories; multivalued logic circuits; ECC; FCR; NAND flash memory; P-E cycle; SSD storage system; bit error rate; dominant error source; enterprise server; error correcting codes; flash correct and refresh; flash memory chip; flash memory lifetime; flash-based storage; mobile platform; multilevel cell technology; nanometer regime process geometry; program-erase cycle; retention aware error management; retention error; solid-state drive; Bit error rate; Error correction codes; Flash memory; Nonvolatile memory; Programming; Threshold voltage; NAND Flash; error correction; multi-level cell (MLC); reliability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Design (ICCD), 2012 IEEE 30th International Conference on
Conference_Location :
Montreal, QC
ISSN :
1063-6404
Print_ISBN :
978-1-4673-3051-0
Type :
conf
DOI :
10.1109/ICCD.2012.6378623
Filename :
6378623
Link To Document :
بازگشت