Managing DRAM Latency Divergence in Irregular GPGPU Applications

Author

Chatterjee, Niladrish ; O´Connor, Mike ; Loh, Gabriel H. ; Jayasena, Nuwan ; Balasubramonia, Rajeev

fYear

2014

fDate

16-21 Nov. 2014

Firstpage

128

Lastpage

139

Abstract

Memory controllers in modern GPUs aggressively reorder requests for high bandwidth usage, often interleaving requests from different warps. This leads to high variance in the latency of different requests issued by the threads of a warp. Since a warp in a SIMT architecture can proceed only when all of its memory requests are returned by memory, such latency divergence causes significant slowdown when running irregular GPGPU applications. To solve this issue, we propose memory scheduling mechanisms that avoid inter-warp interference in the DRAM system to reduce the average memory stall latency experienced by warps. We further reduce latency divergence through mechanisms that coordinate scheduling decisions across multiple independent memory channels. Finally we show that carefully orchestrating the memory scheduling policy can achieve low average latency for warps, without compromising bandwidth utilization. Our combined scheme yields a 10.1% performance improvement for irregular GPGPU workloads relative to a throughput-optimized GPU memory controller.

Keywords

DRAM chips; graphics processing units; storage management chips; DRAM system; SIMT architecture; average memory stall latency; bandwidth utilization; high bandwidth usage; independent memory channels; interleaving requests; interwarp interference; irregular GPGPU applications; latency divergence; memory controllers; memory requests; memory scheduling mechanisms; memory scheduling policy; scheduling decisions; throughput-optimized GPU memory controller; Bandwidth; Graphics processing units; Instruction sets; Memory management; Parallel processing; Random access memory;

fLanguage

English

Publisher

ieee

Conference_Titel

High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for

Conference_Location

New Orleans, LA

Print_ISBN

978-1-4799-5499-5

Type

conf

DOI

10.1109/SC.2014.16

Filename

7012998