Title :
Dynamic Transfer of Computation to Processor Cache for Yield and Reliability Improvement
Author :
Paul, Somnath ; Bhunia, Swarup
Author_Institution :
Dept. of EECS, Case Western Reserve Univ., Cleveland, OH, USA
Abstract :
VLSI systems in the nanometer regime suffer from high defect rates and large parametric variations that lead to yield loss as well as reduced reliability of operation. An architectural framework that ensures proper system operation when few functional units are defective or unreliable under process-induced or temporal parametric variations can be effective in improving manufacturing yield and overall system reliability. In this paper, we propose a novel memory-based computational framework that exploits the on-chip memory to perform computation on demand using a lookup table (LUT)-based approach. The framework achieves reliable operation by transferring activity to embedded memory of a processor from a defective or unreliable functional unit. This allows the die to run at a reduced (but acceptable) performance level instead of being completely discarded due to unit failure (in case of defective functional unit) or being throttled (in case of temporal parameter variations, e.g., temperature induced variations). We note that although the worst-case latency of memory based computation can be considerably higher than regular operation latency, the average latency is only modestly higher due to the abundance of narrow-width operands. Furthermore, the operands for a specific instruction (e.g., integer add, multiply, or floating point add) experience high locality of reference and thus require loading only part of the LUTs in the cache. Simulation results for a set of benchmark applications show that the proposed scheme can significantly improve yield and reliability at the cost of only a small loss in performance (on an average 0.8%) and 10 × less area overhead compared to hardware duplication based defect tolerance approach.
Keywords :
VLSI; microprocessor chips; reliability; VLSI system; architectural framework; dynamic transfer; hardware duplication based defect tolerance; lookup table; manufacturing yield; memory based computation; memory-based computational framework; nanometer regime; narrow-width operands; on-chip memory; overall system reliability; processor cache; proper system operation; temporal parametric variation; unreliable functional unit; worst-case latency; yield and reliability improvement; CMOS technology; Computational modeling; Costs; Logic; Manufacturing processes; Propagation delay; Reliability; Table lookup; Temperature; Thermal management; Activity transfer; memory-based computation; reliability; thermal management; yield;
Journal_Title :
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
DOI :
10.1109/TVLSI.2010.2049389