DocumentCode
3712383
Title
A hardware-based multi-objective thread mapper for tiled manycore architectures
Author
Ravi Kumar Pujari;Thomas Wild;Andreas Herkersdorf
Author_Institution
Institute for Integrated Systems Technische Universitat M?nchen, Germany
fYear
2015
Firstpage
459
Lastpage
462
Abstract
Thread mapping is typically performed as an integral part of cooperative or pre-emptive operating system (OS) scheduling in order to share the processor core(s) among competing applications. Schedulers usually follow a single-objective performance optimization, such as maximizing core utilization or satisfying deadlines by the prioritization of threads. Meeting multiple orthogonal objectives, like performance vs. power or thermal resilience, in the era of manycore processors is a challenge because of the associated scalability and thread management overhead. We tackle these challenges by employing a two stage thread management strategy. In the first stage (not covered in this short paper), threads are assigned to regions or compute tiles. For the second stage we introduce in this paper the TCU (Thread Control Unit), a configurable, low latency, low overhead hardware thread mapper that takes various runtime sensor parameters into account. It can map threads within a small and bounded number of clock cycles in round robin, single or multi-objective manner. TCU is designed to consider not just load balancing or performance criteria but also physical constraints like power budgets, temperature limits and reliability aspects. TCU macro achieves 150K thread mappings per second on a tiled MPSoC FPGA prototype while operating at moderate 50 Mz. Evaluations of different mapping policies show that multi-objective thread mapping provides about 10 to 40% less mapping latency for periodic and bursty traffic compared to single-objective or round robin schemes. FPGA and ASIC syntheses reveal a 9% hardware overhead for the TCU on a four core compute tile.
Keywords
"Instruction sets","Temperature sensors","Message systems","Field programmable gate arrays","Computer architecture","Hardware","Round robin"
Publisher
ieee
Conference_Titel
Computer Design (ICCD), 2015 33rd IEEE International Conference on
Type
conf
DOI
10.1109/ICCD.2015.7357148
Filename
7357148
Link To Document