Title :
Performance evaluation and analysis of thread pinning strategies on multi-core platforms: Case study of SPEC OMP applications on intel architectures
Author :
Mazouz, Abdelhafid ; Touati, Sid-Ahmed-Ali ; Barthou, Denis
Author_Institution :
Univ. of Versailles St. Quentin en Yvelines, Versailles, France
Abstract :
With the introduction of multi-core processors, thread affinity has quickly appeared to be one of the most important factors to accelerate program execution times. The current article presents a complete experimental study on the performance of various thread pinning strategies. We investigate four application independent thread pinning strategies and five application sensitive ones based on cache sharing. We made extensive performance evaluation on three different multi-core machines reflecting three usual utilisation: workstation machine, server machine and high performance machine. In overall, we show that fixing thread affinities (whatever the tested strategy) is a better choice for improving program performance on HPC ccNUMA machines compared to OS-based thread placement. This means that the current Linux OS scheduling strategy is not necessarily the best choice in terms of performance on ccNUMA machines, even if it is a good choice in terms of cores usage ratio and work balancing. On smaller Core2 and Nehalem machines, we show that the benefit of thread pinning is not satisfactory in terms of speedups versus OS based scheduling, but the performance stability is much better.
Keywords :
Linux; cache storage; microprocessor chips; multiprocessing systems; performance evaluation; scheduling; Core2 machines; HPC ccNUMA machine; Intel architectures; Linux OS scheduling strategy; Nehalem machines; OS based thread placement; SPEC OMP applications; cache sharing; cores usage ratio; high performance machine; multicore processors; performance evaluation; program execution times; server machine; thread affinity; thread pinning strategies; work balancing; workstation machine; Instruction sets; Kernel; Linux; Multicore processing; Random access memory; Sockets; Multi-Cores; OpenMP; Operating Systems; Thread Affinity; Thread Level Parallelism;
Conference_Titel :
High Performance Computing and Simulation (HPCS), 2011 International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-61284-380-3
DOI :
10.1109/HPCSim.2011.5999834