Title :
Performance and Energy Efficient Hardware-Based Scheduler for Symmetric/Asymmetric CMPs
Author :
Nikola Markovic;Daniel Nemirovsky;Osman S. Unsal;Marteo Valero;Adrian Cristal
Author_Institution :
Barcelona Supercomput. Center, Barcelona, Spain
Abstract :
As thread level parallelism in applications has continued to expand, so has research in chip multi-core processors. Since more and more applications become multi-threaded we expect to find a growing number of threads executing on a machine. Consequently, the operating system will require increasingly larger amounts of CPU time to schedule these threads efficiently. Instead of perpetuating the trend of performing more complex thread scheduling in the operating system, we propose a hardware implementation of the Thread Lock Section-aware Scheduling (TLSS) scheduling mechanism. This lightweight mechanism helps to identify multi-threaded application bottlenecks such as thread synchronization sections and complements the Fairness-aware Scheduler method. It is, to our knowledge, the first hardware based lock section-aware scheduling that is energy attentive and can be applied to both asymmetric and symmetric CMPs. It achieves an average performance gains of 10.9 percent (geometric mean) compared to the state-of-the-art Linux OS Scheduler when applied on the Symmetrical Chip Multi-Processor (SCMP). At the same time, it is 81 percent more EDP (energy-delay product) efficient when applied on an Asymmetrical Chip Multi-Processor (ACMP) and compared to the Linux OS Scheduler on an SCMP, where ACMP and SCMP take relatively the same chip area.
Keywords :
"Instruction sets","Hardware","Multicore processing","Operating systems","Job shop scheduling","Runtime"
Conference_Titel :
Computer Architecture and High Performance Computing (SBAC-PAD), 2015 27th International Symposium on
DOI :
10.1109/SBAC-PAD.2015.14