Dynamic front-end sharing in graphics processing units

Author

Tao Zhang ; Xiaoyao Liang

Author_Institution

Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai, China

fYear

2014

fDate

19-22 Oct. 2014

Firstpage

286

Lastpage

291

Abstract

A modern GPU processor consumes several times power of a multi-core CPU and delivers a much higher processing throughput. Researchers propose various architectural innovations to improve its energy efficiency. We observe that different streaming processors (SMs) in a GPU tend to exhibit very similar behavior for many GPU workloads. If multiple SMs can be grouped together and work in synchronous manner, it is possible to save energy by sharing the front-end in the SM pipeline including the instruction fetch, decode and schedule units. For efficient flow control and program correctness, the proposed architecture can identify unfavorable conditions and ungroup the SMs when necessary. However, sharing pipeline front-end between multiple SMs brings architectural challenges. In this paper, we show our design, implementation and evaluation for such an architecture. Detailed experiment results manifest 33.7% front-end and 6.8% total GPU energy reduction can be achieved.

Keywords

energy conservation; graphics processing units; multiprocessing systems; GPU energy reduction; GPU processor; dynamic front-end sharing; energy efficiency; graphics processing unit; multicore CPU; streaming processor; Graphics processing units; Kernel; Multiprocessor interconnection; Pipelines; Runtime; Wires;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Design (ICCD), 2014 32nd IEEE International Conference on

Conference_Location

Seoul

Type

conf

DOI

10.1109/ICCD.2014.6974695

Filename

6974695