DocumentCode
704074
Title
Approximate associative memristive memory for energy-efficient GPUs
Author
Rahimi, Abbas ; Ghofrani, Amirali ; Kwang-Ting Cheng ; Benini, Luca ; Gupta, Rajesh K.
Author_Institution
CSE, UC San Diego, La Jolla, CA, USA
fYear
2015
fDate
9-13 March 2015
Firstpage
1497
Lastpage
1502
Abstract
Multimedia applications running on thousands of deep and wide pipelines working concurrently in GPUs have been an important target for power minimization both at the architectural and algorithmic levels. At the hardware level, energy-efficiency techniques that employ voltage overscaling face a barrier so-called “path walls”: reducing operating voltage beyond a certain point generates massive number of timing errors that are impractical to tolerate. We propose an architectural innovation, called A2M2 module (approximate associative memristive memory) that exhibits few tolerable timing errors suitable for GPU applications under voltage overscaling. A2M2 is integrated with every floating point unit (FPU), and performs partial functionality of the associated FPU by pre-storing high frequency patterns for computational reuse that avoids overhead due to re-execution. Voltage overscaled A2M2 is designed to match an input search pattern with any of the stored patterns within a Hamming distance range of 0-2. This matching behavior under voltage overscaling leads to a controllable approximate computing for multimedia applications. Our experimental results for the AMD Southern Islands GPU show that four image processing kernels tolerate the mismatches during pattern matching resulting in a PSNR ≥ 30dB. The A2M2 module with 8-row enables 28% voltage overscaling in 45nm technology resulting in 32% average energy saving for the kernels, while delivering an acceptable quality of service.
Keywords
content-addressable storage; energy conservation; graphics processing units; image matching; multimedia systems; power aware computing; quality of service; A2M2 module; AMD Southern Islands GPU; FPU; Hamming distance; algorithmic level; approximate associative memristive memory; architectural innovation; architectural level; computational reuse; energy-efficient GPU; floating point unit; high frequency patterns; image processing kernels; multimedia applications; path walls; pattern matching; power minimization; quality of service; timing errors; voltage overscaling; Approximation methods; Graphics processing units; Image processing; Kernel; Memristors; PSNR; Pattern matching;
fLanguage
English
Publisher
ieee
Conference_Titel
Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015
Conference_Location
Grenoble
Print_ISBN
978-3-9815-3704-8
Type
conf
Filename
7092626
Link To Document