DocumentCode :
1647192
Title :
Toward a cost-effective DSM organization that exploits processor-memory integration
Author :
Torrellas, Josep ; Yang, Liuxi ; Nguyen, Anthony-Trung
Author_Institution :
Illinois Univ., Urbana, IL, USA
fYear :
2000
fDate :
6/22/1905 12:00:00 AM
Firstpage :
15
Lastpage :
25
Abstract :
Dramatic increases in the number of transistors that can be integrated on a VLSI chip will soon allow commodity microprocessors to include both processor and a sizable fraction of main memory on chip. Distributed Shared-Memory (DSM) multiprocessors typically use the latest off-the-shelf microprocessors and thus will be affected by the upcoming processor-memory integration. In this paper, we explore how a cache-coherent DSM machine built around Processor-In-Memory (PIM) chips might be cost-effectively organized. To take advantage of the close coupling between processor and memory, we propose tagging the memory and organizing it as a cache. Furthermore, commercial considerations dictate the use of off-the-shelf hardware largely designed for uniprocessors. Consequently, we keep the directory control off-chip. To keep the multiprocessor cheap and simple, and to allow for reconfigurability, directory control is performed by chips that are identical to the ones used as compute nodes. As a result, the machine hardware can be easily reconfigured for computing or coherence-handling depending on the needs of the application. We also propose a cache coherence protocol that is tailored to our architecture: it uses the memory very efficiently while exploiting the large caching space available. Overall, the resulting machine is simple and inexpensive, and delivers performance that is comparable to, and higher than, the more expensive traditional COMA and CC-NUMA organizations, respectively
Keywords :
distributed shared memory systems; performance evaluation; protocols; reconfigurable architectures; CC-NUMA organizations; COMA; VLSI chip; cache coherence protocol; cache-coherent DSM machine; commodity microprocessors; distributed shared memory organization; machine hardware; multiprocessors; off-the-shelf hardware; processor-memory integration; reconfigurability; Computer architecture; Hardware; Microprocessors; Organizing; Protocols; Tagging; Very large scale integration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High-Performance Computer Architecture, 2000. HPCA-6. Proceedings. Sixth International Symposium on
Conference_Location :
Touluse
Print_ISBN :
0-7695-0550-3
Type :
conf
DOI :
10.1109/HPCA.2000.824335
Filename :
824335
Link To Document :
بازگشت