Title :
Microarchitectural Online Testing for Failure Detection in Memory Order Buffers
Author :
Carretero, Javier ; Vera, Xavier ; Chaparro, Pedro ; Abella, Jaume
Author_Institution :
Intel Barcelona Res. Center, Intel Labs. Barcelona-UPC, Barcelona, Spain
fDate :
5/1/2010 12:00:00 AM
Abstract :
Technology scaling leads to burn-in phase out and higher postsilicon test complexity, which increases in-the-field failure rate due to both latent defects and actual errors, respectively. As a consequence, current reliability qualification methods will likely be infeasible. Microarchitecture knowledge of application runtime behavior offers a possibility to have low-cost continuous online testing techniques detect hard errors in the field. Whereas data can be protected with redundancy (like parity or ECC), there is a lack of mechanism for control logic. This paper proposes a microarchitectural approach for validating that the memory order buffer logic works correctly. Our design relies on a small cache-like structure that keeps track of the last store to each cached address. Each load is checked to have obtained the data from the youngest older producing store. We present three different implementations of this idea, offering different trade-offs for error coverage, performance overhead, and design complexity.
Keywords :
cache storage; failure analysis; logic design; logic testing; microprocessor chips; burn-in phase out; cache-like structure; cached address; design complexity; error coverage; failure detection; memory order buffer logic; microarchitectural online testing; postsilicon test complexity; reliability qualification method; Computer bugs; Costs; Error correction; Error correction codes; Fault detection; Hardware; Logic testing; Microarchitecture; Protection; Sprites (computer); System testing; Voltage; Online testing; control logic; defects.; error detection; memory order buffer; microarchitecture; soft errors;
Journal_Title :
Computers, IEEE Transactions on
DOI :
10.1109/TC.2009.139