DocumentCode
3129786
Title
Transactional memory coherence and consistency
Author
Hammond, Lance ; Wong, Vicky ; Chen, Mike ; Carlstrom, Brian D. ; Davis, John D. ; Hertzberg, Ben ; Prabhu, Manohar K. ; Wijaya, Honggo ; Kozyrakis, Christos ; Olukotun, Kunle
Author_Institution
Stanford Univ., CA, USA
fYear
2004
fDate
19-23 June 2004
Firstpage
102
Lastpage
113
Abstract
In this paper, we propose a new shared memory model: transactional memory coherence and consistency (TCC). TCC provides a model in which atomic transactions are always the basic unit of parallel work, communication, memory coherence, and memory reference consistency. TCC greatly simplifies parallel software by eliminating the need for synchronization using conventional locks and semaphores, along with their complexities. TCC hardware must combine all writes from each transaction region in a program into a single packet and broadcast this packet to the permanent shared memory state atomically as a large block. This simplifies the coherence hardware because it reduces the need for small, low-latency messages and completely eliminates the need for conventional snoopy cache coherence protocols, as multiple speculatively written versions of a cache line may safely coexist within the system. Meanwhile, automatic, hardware-controlled rollback of speculative transactions resolves any correctness violations that may occur when several processors attempt to read and write the same data simultaneously. The cost of this simplified scheme is higher interprocessor bandwidth. To explore the costs and benefits of TCC, we study the characteristics of an optimal transaction-based memory system, and examine how different design parameters could affect the performance of real systems. Across a spectrum of applications, the TCC model itself did not limit available parallelism. Most applications are easily divided into transactions requiring only small write buffers, on the order of 4-8 KB. The broadcast requirements of TCC are high, but are well within the capabilities of CMPs and small-scale SMPs with high-speed interconnects.
Keywords
buffer storage; distributed shared memory systems; memory architecture; parallel architectures; transaction processing; CMP; atomic transactions; automatic hardware-controlled rollback; broadcast requirements; cache coherence protocols; correctness violations; data reading; data writing; high-speed interconnects; interprocessor bandwidth; low-latency messages; memory reference consistency; optimal transaction-based memory system; packet broadcasting; parallel communication; parallel software; parallel work; permanent shared memory; shared memory model; small-scale SMP; speculative transactions; transaction region; transactional memory coherence; transactional memory consistency; write buffers; Computer architecture;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Architecture, 2004. Proceedings. 31st Annual International Symposium on
ISSN
1063-6897
Print_ISBN
0-7695-2143-6
Type
conf
DOI
10.1109/ISCA.2004.1310767
Filename
1310767
Link To Document