Title :
Temperature aware SoC design for reduced leakage power and enhanced reliability
Author :
Abadir, M.S. ; Khouri, K. ; Gupta, Arpan
Author_Institution :
Freescale Semicond. Inc., Austin, TX
Abstract :
Summary form only given. Improvements in the speed and the size of chips due to process scaling come with the penalties of increased power densities and higher operating temperatures. Higher temperature not only poses a risk of thermal runaway of the chips but also increases leakage-power dissipation and reduces interconnect lifetime. Furthermore, the floorplan of a System-on-a-Chip (SoC) has a significant impact on the temperature profile and leakage power, requiring early analysis of the SoC temperature profiles. We will talk about a system level leakage estimation methodology that considers the floorplan of the SoC and also the positive feedback relation between temperature and leakage power. We also talk about a system-level Leakage-Aware Floorplanner (LEAF) that optimizes floorplans for temperature-aware leakage power along with the traditional metric of area. We observed up to 190% difference in the leakage power between leakage-unaware and leakage-aware floorplanning. Temperature also has a serious effect on the Mean Time to Failure (MTF) of interconnects because of electromigration. To address this issue, we have developed a Temperature Aware Global Router (TAGORE) that reduces the probability of chip failure due to interconnect failures. TAGORE achieves a reduction in the probability of failure by routing more wires in the colder regions of the chip and fewer wires in the hotter regions of the chip. We observed that TAGORE reduced the number of wires in the hottest region of a chip by up to 19.95% and by an average of 12.29%. This resulted in a decrease in the failure rate by up to 292 failures per million hours of operation. This approach to reliability improvement does not require any addition of redundant wires or vias. We also have an analytical examination of the reduction in the probability of interconnect failure and the failure rate due to temperature aware global routing.
Keywords :
electromigration; failure analysis; integrated circuit design; integrated circuit interconnections; integrated circuit reliability; network routing; probability; system-on-chip; TAGORE; chip failure probability; electromigration; interconnect failures; interconnect lifetime; leakage-power dissipation; operating temperatures; power densities; process scaling; reliability improvement; system level leakage estimation methodology; system-level leakage-aware floorplanner; system-on-a-chip; temperature aware SoC design; temperature aware global router; temperature profile; temperature-aware leakage power; thermal runaway; Electromigration; Failure analysis; Feedback; Power system interconnection; Routing; Semiconductor device reliability; System-on-a-chip; Temperature distribution; Wires;
Conference_Titel :
VLSI Design, Automation and Test, 2008. VLSI-DAT 2008. IEEE International Symposium on
Conference_Location :
Hsinchu
Print_ISBN :
978-1-4244-1616-5
Electronic_ISBN :
978-1-4244-1617-2
DOI :
10.1109/VDAT.2008.4542398