Title :
Dynamic Thread Scheduling in Asymmetric Multicores to Maximize Performance-per-Watt
Author :
Annamalai, Arunachalam ; Rodrigues, Rance ; Koren, Israel ; Kundu, Sandip
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Massachusetts at Amherst, Amherst, MA, USA
Abstract :
Recent trends in technology scaling have enabled the incorporation of multiple processor cores on a single die. Depending on the characteristics of the cores, the multicore may be either symmetric (SMP) or asymmetric (AMP). Several studies have shown that in general, for a given resource and power budget, AMPs are likely to outperform their SMP counterparts. However, due to the heterogeneity in AMPs, scheduling threads is always a challenge. To address the issue of thread scheduling in AMP, we propose a novel dynamic thread scheduling scheme that continuously monitors the current characteristics of the executing threads and determines the best thread to core assignment. The real-time monitoring is done using hardware performance counters that capture several microarchitecture independent characteristics of the threads in order to determine the thread to core affinity. By controlling thread scheduling in hardware, the Operating System (OS) need not be aware of the underlying microarchitecture, significantly simplifying the OS scheduler for an AMP architecture. The proposed scheme is compared against a simple Round Robin scheduling and a recently published dynamic thread scheduling technique that allows swapping of threads (between asymmetric cores) at coarse grain time intervals, once every context switch (~20 ms for the Linux scheduler). The presented results indicate that our proposed scheme is able to achieve, on average, a performance/watt benefit of 10.5% over the previously published dynamic scheduling scheme and about 12.9% over the Round Robin scheme.
Keywords :
multiprocessing systems; operating systems (computers); real-time systems; scheduling; Linux scheduler; OS scheduler; asymmetric multicores; core assignment; dynamic thread scheduling; microarchitecture; multiple processor cores; operating system; performance-per-watt maximization; realtime monitoring; round robin scheduling; technology scaling; Benchmark testing; Dynamic scheduling; Hardware; Instruction sets; Message systems; Monitoring; Multicore processing; Asymmetric Multicore Processor (AMP); Instructions per cycle (IPC); Thread swapping;
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0974-5
DOI :
10.1109/IPDPSW.2012.118