DocumentCode :
2283316
Title :
Error Detection via Online Checking of Cache Coherence with Token Coherence Signatures
Author :
Meixner, Albert ; Sorin, Daniel J.
Author_Institution :
Dept. of Comput. Sci., Duke Univ., Durham, NC
fYear :
2007
fDate :
10-14 Feb. 2007
Firstpage :
145
Lastpage :
156
Abstract :
To provide high dependability in a multithreaded system despite hardware faults, the system must detect and correct errors in its shared memory system. Recent research has explored dynamic checking of cache coherence as a comprehensive approach to memory system error detection. However, existing coherence checkers are costly to implement, incur high interconnection network traffic overhead, and do not scale well. In this paper, we describe the token coherence signature checker (TCSC), which provides comprehensive, low-cost, scalable coherence checking by maintaining signatures that represent recent histories of coherence events at all nodes (cache and memory controllers). Periodically, these signatures are sent to a verifier to determine if an error occurred. TCSC has a small constant hardware cost per node, independent of cache and memory size and the number of nodes. TCSC´s interconnect bandwidth overhead has a constant upper bound and never exceeds 7% in our experiments. TCSC has negligible impact on system performance
Keywords :
cache storage; multi-threading; protocols; shared memory systems; cache coherence; cache controller; coherence events; error detection; interconnect bandwidth overhead; memory controller; multithreaded system; online checking; shared memory system; token coherence signature checker; Bandwidth; Communication system traffic control; Costs; Error correction; Fault detection; Hardware; History; Multiprocessor interconnection networks; Power capacitors; Thyristors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computer Architecture, 2007. HPCA 2007. IEEE 13th International Symposium on
Conference_Location :
Scottsdale, AZ
Print_ISBN :
1-4244-0805-9
Electronic_ISBN :
1-4244-0805-9
Type :
conf
DOI :
10.1109/HPCA.2007.346193
Filename :
4147656
Link To Document :
بازگشت