DocumentCode :
146231
Title :
Design and implementation of novel source synchronous interconnection in modern GPU chips
Author :
Tao Li ; Sadowski, Greg
Author_Institution :
GPU Group, Adv. Macro Device (AMD), Shanghai, China
fYear :
2014
fDate :
2-5 Sept. 2014
Firstpage :
130
Lastpage :
135
Abstract :
As the architecture of GPU chips evolves to provide higher performance with lower power, new topology of graphics shader engines interconnection to local frame buffers becomes critical. Source synchronous interconnection has been widely adopted in Network-On-Chip (NoC). The SSB bus fabric to transfer data between shader engines and frame buffers adopts more of the globally asynchronous locally synchronous (GALS) design style for a large size GPU chip, in order to deal with the challenge of delivering synchronous high frequencies clocks in the GHz range across full chip. It also reduces the area cost and power consumption on long distance wide width data transfer. In this paper, we present the design structure and physical implementation of a novel source synchronous interconnect network for GALS-style GPU topology. This combines the source synchronous bus lane together with Multiple Data Rate (MDR) structure and much higher transmission clock frequency than shader clock to provide high bandwidth, high speed, low area cost data transmission fabric for GPU chips. We also developed MDR signal bits encoding techniques to reduce the toggle rate of the MDR data nets. With clock gating scheme and MDR signal encoding techniques adapted to the applications, we could further reduce the total power on the SSB transmission fabric.
Keywords :
graphics processing units; integrated circuit interconnections; network-on-chip; GALS design; GALS-style GPU topology; GPU chips; MDR signal bits encoding techniques; MDR signal encoding techniques; MDR structure; NoC; SSB bus fabric; SSB transmission fabric; clock gating scheme; data transmission fabric; globally asynchronous locally synchronous design; graphics shader engines interconnection; local frame buffers; long distance wide width data transfer; multiple data rate structure; network-on-chip; source synchronous interconnection; toggle rate; transmission clock frequency; Amplitude modulation; Clocks; Encoding; Energy dissipation; Graphics processing units; Power demand; Repeaters; AOCV; GALS; GPU; MDR; NoC; OCV; SSB; ToF;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System-on-Chip Conference (SOCC), 2014 27th IEEE International
Conference_Location :
Las Vegas, NV
Type :
conf
DOI :
10.1109/SOCC.2014.6948913
Filename :
6948913
Link To Document :
بازگشت