مرکز منطقه ای اطلاع رساني علوم و فناوري - Exploiting Fine-Grained Data Parallelism with Chip Multiprocessors and Fast Barriers

DocumentCode :

2977718

Title :

Exploiting Fine-Grained Data Parallelism with Chip Multiprocessors and Fast Barriers

Author :

Sampson, Jack ; González, Rubén ; Collard, Jean-Francois ; Jouppi, Norman P. ; Schlansker, Mike ; Calder, Brad

Author_Institution :

Dept of Comput. Sci. & Eng., California Univ., San Diego, CA

fYear :

2006

fDate :

Dec. 2006

Firstpage :

235

Lastpage :

246

Abstract :

We examine the ability of CMPs, due to their lower on-chip communication latencies, to exploit data parallelism at inner-loop granularities similar to that commonly targeted by vector machines. Parallelizing code in this manner leads to a high frequency of barriers, and we explore the impact of different barrier mechanisms upon the efficiency of this approach. To further exploit the potential of CMPs for fine-grained data parallel tasks, we present barrier filters, a mechanism for fast barrier synchronization on-chip multi-processors to enable vector computations to be efficiently distributed across the cores of a CMP. We ensure that all threads arriving at a barrier require an unavailable cache line to proceed, and, by placing additional hardware in the shared portions of the memory subsystem, we starve their requests until they all have arrived. Specifically, our approach uses invalidation requests to both make cache lines unavailable and identify when a thread has reached the barrier. We examine two types of barrier filters, one synchronizing through instruction cache lines, and the other through data cache lines

Keywords :

cache storage; microprocessor chips; synchronisation; vector processor systems; barrier filter; barrier synchronization; chip multiprocessor; data cache line; fine-grained data parallelism; instruction cache line; memory subsystem; vector machine; Concurrent computing; Delay; Distributed computing; Filters; Frequency synchronization; Laboratories; Lifting equipment; Parallel processing; Pipelines; Yarn;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Microarchitecture, 2006. MICRO-39. 39th Annual IEEE/ACM International Symposium on

Conference_Location :

Orlando, FL

ISSN :

1072-4451

Print_ISBN :

0-7695-2732-9

Type :

conf

DOI :

10.1109/MICRO.2006.23

Filename :

4041850

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2977718