• DocumentCode
    659544
  • Title

    Memory system characterization of big data workloads

  • Author

    Dimitrov, Martin ; Kumar, Kush ; Lu, Pingping ; Viswanathan, V. ; Willhalm, Thomas

  • fYear
    2013
  • fDate
    6-9 Oct. 2013
  • Firstpage
    15
  • Lastpage
    22
  • Abstract
    Two recent trends that have emerged include (1) Rapid growth in big data technologies with new types of computing models to handle unstructured data, such as map-reduce and noSQL (2) A growing focus on the memory subsystem for performance and power optimizations, particularly with emerging memory technologies offering different characteristics from conventional DRAM (bandwidths, read/write asymmetries). This paper examines how these trends may intersect by characterizing the memory access patterns of various Hadoop and noSQL big data workloads. Using memory DIMM traces collected using special hardware, we analyze the spatial and temporal reference patterns to bring out several insights related to memory and platform usages, such as memory footprints, read-write ratios, bandwidths, latencies, etc. We develop an analysis methodology to understand how conventional optimizations such as caching, prediction, and prefetching may apply to these workloads, and discuss the implications on software and system design.
  • Keywords
    Big Data; DRAM chips; cache storage; database management systems; DRAM; Hadoop; Map-reduce; bandwidth; big data technology; big data workload; caching; computing model; latency; memory DIMM traces; memory access pattern; memory footprint; memory subsystem; memory system characterization; memory usage; noSQL; performance optimization; platform usage; power optimization; prediction; prefetching; read-write asymmetry; read-write ratio; software design; spatial reference pattern; system design; temporal reference pattern; unstructured data handling; Data handling; Data storage systems; Entropy; Information management; Market research; Measurement; Prefetching; big data; memory characterization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data, 2013 IEEE International Conference on
  • Conference_Location
    Silicon Valley, CA
  • Type

    conf

  • DOI
    10.1109/BigData.2013.6691693
  • Filename
    6691693