Title :
Dynamic front-end sharing in graphics processing units
Author :
Tao Zhang ; Xiaoyao Liang
Author_Institution :
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China
Abstract :
A modern GPU processor consumes several times power of a multi-core CPU and delivers a much higher processing throughput. Researchers propose various architectural innovations to improve its energy efficiency. We observe that different streaming processors (SMs) in a GPU tend to exhibit very similar behavior for many GPU workloads. If multiple SMs can be grouped together and work in synchronous manner, it is possible to save energy by sharing the front-end in the SM pipeline including the instruction fetch, decode and schedule units. For efficient flow control and program correctness, the proposed architecture can identify unfavorable conditions and ungroup the SMs when necessary. However, sharing pipeline front-end between multiple SMs brings architectural challenges. In this paper, we show our design, implementation and evaluation for such an architecture. Detailed experiment results manifest 33.7% front-end and 6.8% total GPU energy reduction can be achieved.
Keywords :
energy conservation; graphics processing units; multiprocessing systems; GPU energy reduction; GPU processor; dynamic front-end sharing; energy efficiency; graphics processing unit; multicore CPU; streaming processor; Graphics processing units; Kernel; Multiprocessor interconnection; Pipelines; Runtime; Wires;
Conference_Titel :
Computer Design (ICCD), 2014 32nd IEEE International Conference on
Conference_Location :
Seoul
DOI :
10.1109/ICCD.2014.6974695