DocumentCode :
3120050
Title :
A novel migration-based NUCA design for Chip Multiprocessors
Author :
Kandemir, Mahmut ; Li, Feihui ; Irwin, Mary Jane ; Son, Seung Woo
Author_Institution :
Pennsylvania State Univ., University Park, PA, USA
fYear :
2008
fDate :
15-21 Nov. 2008
Firstpage :
1
Lastpage :
12
Abstract :
Chip Multiprocessors (CMPs) and Non-Uniform Cache Architectures (NUCAs) represent two emerging trends in computer architecture. Targeting future CMP based systems with NUCA type L2 caches, this paper proposes a novel data migration algorithm for parallel applications and evaluates it. The goal of this migration scheme is to determine a suitable location for each data block within a large L2 space at any given point during execution. A unique characteristic of the proposed scheme is that it models the problem of optimal data placement in the L2 cache space as a two-dimensional post office placement problem, presents a practical architectural implementation of this model, and gives a detailed evaluation of the proposed implementation. In our experimental evaluation, we also compare our approach to a previously-proposed NUCA management scheme using applications from the specomp suite, oltp, specjbb, and specweb. These experiments show that our migration approach generates about 35% improvement, on average, in average L2 access latency over the previous migration scheme, and these L2 latency savings translate, on average, to 9.5% improvement in IPC (instructions per cycle).We also observed during our experiments that both the careful initial placement of data (which itself triggers migrations within the L2 space) and subsequent migrations (due to inter-processor data sharing) play an important role in achieving our performance improvements.
Keywords :
computer architecture; microprocessor chips; multiprocessing systems; 2D post office placement problem; CMP based system; L2 access latency; NUCA management scheme; chip multiprocessor; computer architecture; data block; data migration algorithm; migration-based NUCA design; non-uniform cache architecture; optimal data placement; Application software; Buildings; Clocks; Computer architecture; Delay; Energy consumption; Frequency; Parallel processing; Runtime; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for
Conference_Location :
Austin, TX
Print_ISBN :
978-1-4244-2834-2
Electronic_ISBN :
978-1-4244-2835-9
Type :
conf
DOI :
10.1109/SC.2008.5216918
Filename :
5216918
Link To Document :
بازگشت