DocumentCode :
3129786
Title :
Transactional memory coherence and consistency
Author :
Hammond, Lance ; Wong, Vicky ; Chen, Mike ; Carlstrom, Brian D. ; Davis, John D. ; Hertzberg, Ben ; Prabhu, Manohar K. ; Wijaya, Honggo ; Kozyrakis, Christos ; Olukotun, Kunle
Author_Institution :
Stanford Univ., CA, USA
fYear :
2004
fDate :
19-23 June 2004
Firstpage :
102
Lastpage :
113
Abstract :
In this paper, we propose a new shared memory model: transactional memory coherence and consistency (TCC). TCC provides a model in which atomic transactions are always the basic unit of parallel work, communication, memory coherence, and memory reference consistency. TCC greatly simplifies parallel software by eliminating the need for synchronization using conventional locks and semaphores, along with their complexities. TCC hardware must combine all writes from each transaction region in a program into a single packet and broadcast this packet to the permanent shared memory state atomically as a large block. This simplifies the coherence hardware because it reduces the need for small, low-latency messages and completely eliminates the need for conventional snoopy cache coherence protocols, as multiple speculatively written versions of a cache line may safely coexist within the system. Meanwhile, automatic, hardware-controlled rollback of speculative transactions resolves any correctness violations that may occur when several processors attempt to read and write the same data simultaneously. The cost of this simplified scheme is higher interprocessor bandwidth. To explore the costs and benefits of TCC, we study the characteristics of an optimal transaction-based memory system, and examine how different design parameters could affect the performance of real systems. Across a spectrum of applications, the TCC model itself did not limit available parallelism. Most applications are easily divided into transactions requiring only small write buffers, on the order of 4-8 KB. The broadcast requirements of TCC are high, but are well within the capabilities of CMPs and small-scale SMPs with high-speed interconnects.
Keywords :
buffer storage; distributed shared memory systems; memory architecture; parallel architectures; transaction processing; CMP; atomic transactions; automatic hardware-controlled rollback; broadcast requirements; cache coherence protocols; correctness violations; data reading; data writing; high-speed interconnects; interprocessor bandwidth; low-latency messages; memory reference consistency; optimal transaction-based memory system; packet broadcasting; parallel communication; parallel software; parallel work; permanent shared memory; shared memory model; small-scale SMP; speculative transactions; transaction region; transactional memory coherence; transactional memory consistency; write buffers; Computer architecture;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Architecture, 2004. Proceedings. 31st Annual International Symposium on
ISSN :
1063-6897
Print_ISBN :
0-7695-2143-6
Type :
conf
DOI :
10.1109/ISCA.2004.1310767
Filename :
1310767
Link To Document :
بازگشت