DocumentCode :
2504933
Title :
Using overdecomposition to overlap communication latencies with computation and take advantage of SMT processors
Author :
Bongo, Lars Ailo ; Vinter, Brian ; Anshus, Otto J. ; Larsen, Tore ; Bjørndalen, John Markus
Author_Institution :
Dept. of Comput. Sci., Tromso Univ.
fYear :
0
fDate :
0-0 0
Lastpage :
247
Abstract :
Parallel programs running on clusters are typically decomposed and mapped to run with one thread per processor each working on its disjoint subset of the data. We evaluate performance improvements and limitations for a micro-benchmark and the NAS benchmarks, by using overdecomposition to map multiple threads to each processor to overlap computation with communication. The experiment platform is a cluster with Pentium 4 symmetric multithreading (SMT) processor nodes interconnected through gigabit Ethernet. Micro-benchmark results demonstrate execution time improvements up to 1.8. However, for the NAS benchmarks overdecomposition and SMT provides only slight performance gains, and sometimes significant performance loss. We evaluated improvement and limitation sensitivity to problem size, communication structure and whether SMT is enabled or not. We found that performance improvements are limited by applications having communication dependencies that limit thread-level parallelism, increase in cache misses, or increased systems activity. Our study contributes a better understanding of these limitations
Keywords :
local area networks; multi-threading; multiprocessor interconnection networks; NAS benchmark overdecomposition; SMT processors; communication structure; gigabit Ethernet; microbenchmark; multiple threads; parallel programs; symmetric multithreading processor; thread-level parallelism; Application software; Costs; Delay; Ethernet networks; Kernel; Linux; Multithreading; Parallel processing; Surface-mount technology; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing Workshops, 2006. ICPP 2006 Workshops. 2006 International Conference on
Conference_Location :
Columbus, OH
ISSN :
1530-2016
Print_ISBN :
0-7695-2637-3
Type :
conf
DOI :
10.1109/ICPPW.2006.77
Filename :
1690707
Link To Document :
بازگشت