• DocumentCode
    166195
  • Title

    NDC: Analyzing the impact of 3D-stacked memory+logic devices on MapReduce workloads

  • Author

    Pugsley, Seth H. ; Jestes, Jeffrey ; Huihui Zhang ; Balasubramonian, R. ; Srinivasan, V. ; Buyuktosunoglu, Alper ; Davis, A.K. ; Feifei Li

  • Author_Institution
    Univ. of Utah, Salt Lake City, UT, USA
  • fYear
    2014
  • fDate
    23-25 March 2014
  • Firstpage
    190
  • Lastpage
    200
  • Abstract
    While Processing-in-Memory has been investigated for decades, it has not been embraced commercially. A number of emerging technologies have renewed interest in this topic. In particular, the emergence of 3D stacking and the imminent release of Micron´s Hybrid Memory Cube device have made it more practical to move computation near memory. However, the literature is missing a detailed analysis of a killer application that can leverage a Near Data Computing (NDC) architecture. This paper focuses on in-memory MapReduce workloads that are commercially important and are especially suitable for NDC because of their embarrassing parallelism and largely localized memory accesses. The NDC architecture incorporates several simple processing cores on a separate, non-memory die in a 3D-stacked memory package; these cores can perform Map operations with efficient memory access and without hitting the bandwidth wall. This paper describes and evaluates a number of key elements necessary in realizing efficient NDC operation: (i) low-EPI cores, (ii) long daisy chains of memory devices, (iii) the dynamic activation of cores and SerDes links. Compared to a baseline that is heavily optimized for MapReduce execution, the NDC design yields up to 15X reduction in execution time and 18X reduction in system energy.
  • Keywords
    computer architecture; logic devices; storage management chips; 3D-stacked memory+logic devices; MapReduce workloads; Micron hybrid memory cube device; NDC architecture; SerDes links; long daisy chains; low-EPI cores; near data computing architecture; processing-in-memory; Bandwidth; Central Processing Unit; Databases; Memory management; Random access memory; Three-dimensional displays;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Performance Analysis of Systems and Software (ISPASS), 2014 IEEE International Symposium on
  • Conference_Location
    Monterey, CA
  • Print_ISBN
    978-1-4799-3604-5
  • Type

    conf

  • DOI
    10.1109/ISPASS.2014.6844483
  • Filename
    6844483