• DocumentCode
    704074
  • Title

    Approximate associative memristive memory for energy-efficient GPUs

  • Author

    Rahimi, Abbas ; Ghofrani, Amirali ; Kwang-Ting Cheng ; Benini, Luca ; Gupta, Rajesh K.

  • Author_Institution
    CSE, UC San Diego, La Jolla, CA, USA
  • fYear
    2015
  • fDate
    9-13 March 2015
  • Firstpage
    1497
  • Lastpage
    1502
  • Abstract
    Multimedia applications running on thousands of deep and wide pipelines working concurrently in GPUs have been an important target for power minimization both at the architectural and algorithmic levels. At the hardware level, energy-efficiency techniques that employ voltage overscaling face a barrier so-called “path walls”: reducing operating voltage beyond a certain point generates massive number of timing errors that are impractical to tolerate. We propose an architectural innovation, called A2M2 module (approximate associative memristive memory) that exhibits few tolerable timing errors suitable for GPU applications under voltage overscaling. A2M2 is integrated with every floating point unit (FPU), and performs partial functionality of the associated FPU by pre-storing high frequency patterns for computational reuse that avoids overhead due to re-execution. Voltage overscaled A2M2 is designed to match an input search pattern with any of the stored patterns within a Hamming distance range of 0-2. This matching behavior under voltage overscaling leads to a controllable approximate computing for multimedia applications. Our experimental results for the AMD Southern Islands GPU show that four image processing kernels tolerate the mismatches during pattern matching resulting in a PSNR ≥ 30dB. The A2M2 module with 8-row enables 28% voltage overscaling in 45nm technology resulting in 32% average energy saving for the kernels, while delivering an acceptable quality of service.
  • Keywords
    content-addressable storage; energy conservation; graphics processing units; image matching; multimedia systems; power aware computing; quality of service; A2M2 module; AMD Southern Islands GPU; FPU; Hamming distance; algorithmic level; approximate associative memristive memory; architectural innovation; architectural level; computational reuse; energy-efficient GPU; floating point unit; high frequency patterns; image processing kernels; multimedia applications; path walls; pattern matching; power minimization; quality of service; timing errors; voltage overscaling; Approximation methods; Graphics processing units; Image processing; Kernel; Memristors; PSNR; Pattern matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015
  • Conference_Location
    Grenoble
  • Print_ISBN
    978-3-9815-3704-8
  • Type

    conf

  • Filename
    7092626