DocumentCode
11498
Title
Process Placement in Multicore Clusters:Algorithmic Issues and Practical Techniques
Author
Jeannot, Emmanuel ; Mercier, Guillaume ; Tessier, Francois
Author_Institution
INRIA Bordeaux Sud-Ouest, Talence, France
Volume
25
Issue
4
fYear
2014
fDate
Apr-14
Firstpage
993
Lastpage
1002
Abstract
Current generations of NUMA node clusters feature multicore or manycore processors. Programming such architectures efficiently is a challenge because numerous hardware characteristics have to be taken into account, especially the memory hierarchy. One appealing idea to improve the performance of parallel applications is to decrease their communication costs by matching the communication pattern to the underlying hardware architecture. In this paper, we detail the algorithm and techniques proposed to achieve such a result: first, we gather both the communication pattern information and the hardware details. Then we compute a relevant reordering of the various process ranks of the application. Finally, those new ranks are used to reduce the communication costs of the application.
Keywords
multiprocessing systems; parallel programming; pattern matching; NUMA node clusters; algorithmic issues; application communication cost reduction; application process rank reordering; communication costs; communication pattern information; communication pattern matching; hardware architecture; hardware characteristics; manycore processor; memory hierarchy; multicore clusters; multicore processor; parallel applications; performance improvement; process placement; programming; Computational modeling; Hardware; Multicore processing; Network topology; Standards; Topology; Parallel programming; high performance computing; multicore processing;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2013.104
Filename
6495451
Link To Document