DocumentCode :
451152
Title :
Evaluating Titanium SPMD Programs on the Tera MTA
Author :
Miyamoto, Carleton ; Lin, Chang
Author_Institution :
University of California, Berkeley
fYear :
1999
fDate :
13-18 Nov. 1999
Firstpage :
43
Lastpage :
43
Abstract :
While the common trend in building large-scale multiprocessors is to use commodity compute nodes that are increasingly powerful and have deep memory hierarchies, the Tera MTA uses a different design point, with a relatively flat memory system, no processor caches, and hardware support for light-weight multithreading, which is used to mask memory latency. In this paper we explore the implementation of Titanium, a language with coarse-grained SPMD parallelism, onto the MTA. The major concerns in obtaining high performance on the MTA are sufficient degrees of parallelism, good load balance, and low synchronization overhead. We show that by adding loop level parallelism, Titanium applications have sufficient parallelism for the MTA, and as expected, application writers do not need to orchestrate data layout. We evaluate multiple implementations of the Titanium synchronization constructs, which include barriers and monitors. We then explore several scheduling strategies, and find that the distinction between SPMD and loop level parallelism proves to be surprisingly useful. The two-level parallelism structure can be used to throttle thread migration, which lowers thread creation overhead and synchronization. We use a combination of micro-benchmarks and applications to demonstrate these results.
Keywords :
Buildings; Computer science; Contracts; Delay; Hardware; Large-scale systems; Multithreading; Parallel processing; Titanium; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing, ACM/IEEE 1999 Conference
Print_ISBN :
1-58113-091-0
Type :
conf
DOI :
10.1109/SC.1999.10032
Filename :
1592686
Link To Document :
بازگشت