Title :
Organizing the last line of defense before hitting the memory wall for CMPs
Author :
Liu, Chun ; Sivasubramaniam, Anand ; Kandemir, Mahmut
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
Abstract :
The last line of defense in the cache hierarchy before going to off-chip memory is very critical in chip multiprocessors (CMPs) from both the performance and power perspectives. We investigate different organizations for this last line of defense (assumed to be L2 in this article) towards reducing off-chip memory accesses. We evaluate the trade-offs between private L2 and address-interleaved shared L2 designs, noting their individual benefits and drawbacks. The possible imbalance between the L2 demands across the CPUs favors a shared L2 organization, while the interference between these demands can favor a private L2 organization. We propose a new architecture, called Shared Processor-Based Split L2, that captures the benefits of these two organizations, while avoiding many of their drawbacks. Using several applications from the SPEC OMP suite and a commercial benchmark, Specjbb, on a complete system simulator, we demonstrate the benefits of this shared processor-based L2 organization. Our results show as much as 42.50% improvement in IPC over the private organization (with 11.52% on the average), and as much as 42.22% improvement over the shared interleaved organization (with 9.76% on the average).
Keywords :
cache storage; memory architecture; microprocessor chips; CMP; CPU; IPC; SPEC OMP; Shared Processor-Based Split; Specjbb; address-interleaved shared design; cache hierarchy; chip multiprocessor; memory wall; off-chip memory; shared interleaved organization; system simulator; Buildings; Computer science; Costs; Interference; Organizing; Parallel processing; Program processors; Runtime; System-on-a-chip; Yarn;
Conference_Titel :
Software, IEE Proceedings-
Print_ISBN :
0-7695-2053-7
DOI :
10.1109/HPCA.2004.10017