DocumentCode :
659544
Title :
Memory system characterization of big data workloads
Author :
Dimitrov, Martin ; Kumar, Kush ; Lu, Pingping ; Viswanathan, V. ; Willhalm, Thomas
fYear :
2013
fDate :
6-9 Oct. 2013
Firstpage :
15
Lastpage :
22
Abstract :
Two recent trends that have emerged include (1) Rapid growth in big data technologies with new types of computing models to handle unstructured data, such as map-reduce and noSQL (2) A growing focus on the memory subsystem for performance and power optimizations, particularly with emerging memory technologies offering different characteristics from conventional DRAM (bandwidths, read/write asymmetries). This paper examines how these trends may intersect by characterizing the memory access patterns of various Hadoop and noSQL big data workloads. Using memory DIMM traces collected using special hardware, we analyze the spatial and temporal reference patterns to bring out several insights related to memory and platform usages, such as memory footprints, read-write ratios, bandwidths, latencies, etc. We develop an analysis methodology to understand how conventional optimizations such as caching, prediction, and prefetching may apply to these workloads, and discuss the implications on software and system design.
Keywords :
Big Data; DRAM chips; cache storage; database management systems; DRAM; Hadoop; Map-reduce; bandwidth; big data technology; big data workload; caching; computing model; latency; memory DIMM traces; memory access pattern; memory footprint; memory subsystem; memory system characterization; memory usage; noSQL; performance optimization; platform usage; power optimization; prediction; prefetching; read-write asymmetry; read-write ratio; software design; spatial reference pattern; system design; temporal reference pattern; unstructured data handling; Data handling; Data storage systems; Entropy; Information management; Market research; Measurement; Prefetching; big data; memory characterization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data, 2013 IEEE International Conference on
Conference_Location :
Silicon Valley, CA
Type :
conf
DOI :
10.1109/BigData.2013.6691693
Filename :
6691693
Link To Document :
بازگشت