DocumentCode
54527
Title
Data Remapping for Static NUCA in Degradable Chip Multiprocessors
Author
Ying Wang ; Lei Zhang ; Yin-He Han ; Hua-Wei Li ; Xiaowei Li
Author_Institution
State Key Lab. of Comput. Archit., Inst. of Comput. Technol., Beijing, China
Volume
23
Issue
5
fYear
2015
fDate
May-15
Firstpage
879
Lastpage
892
Abstract
In chip multiprocessors (CMPs), nonuniform cache architecture (NUCA) is often employed to organize last-level cache (LLC) banks through network-on-chip (NoC). Because of the shrinking feature size and unstable operating environment, severe reliability problems unavoidably emerge and cause frequent on-chip component (e.g., cores, cache banks, routers) failures. Typical fault-tolerant CMPs should possess the feature of graceful degradation and function normally with deactivated tiles. However, for CMPs adopting static NUCA, certain physical address areas will become inaccessible when cache banks in a CMP node are isolated from the system. To protect cache from such threats induced by either online or offline faults, we survey several potential solutions and propose the utility-driven node remapping technique by reusing the resources in NoC. In our NoC-assisted remapping scheme, cache accesses to isolated banks are so redirected that cache space contention are successfully balanced and relieved in shared-LLC, thus ensuring the least performance penalty caused by fault isolation. Our experimental results show significant performance improvement over conventional resizing approaches such as set reduction.
Keywords
cache storage; fault diagnosis; fault tolerance; microprocessor chips; network-on-chip; LLC banks; NoC; cache banks; cache space contention; data remapping; degradable chip multiprocessors; fault isolation; fault-tolerant CMP; last-level cache; network-on-chip; nonuniform cache architecture; static NUCA; utility-driven node remapping technique; Circuit faults; Decoding; Fault tolerance; Fault tolerant systems; System-on-chip; Chip multiprocessor (CMP); fault tolerant; network-on-chip (NoC); nonuniform cache architecture (NUCA); nonuniform cache architecture (NUCA).;
fLanguage
English
Journal_Title
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
Publisher
ieee
ISSN
1063-8210
Type
jour
DOI
10.1109/TVLSI.2014.2325933
Filename
6835194
Link To Document