DocumentCode :
186383
Title :
Deep-dive analysis of the data analytics workload in CloudSuite
Author :
Yasin, Ahmad ; Ben-Asher, Yosi ; Mendelson, Avi
Author_Institution :
Intel Corp., Hillsboro, OR, USA
fYear :
2014
fDate :
26-28 Oct. 2014
Firstpage :
202
Lastpage :
211
Abstract :
Exponential growth of digital data has introduced massively-parallel systems, special orchestration layers, and new scale-out applications. While recent works suggest characteristics of scale-out workloads are different from those of traditional ones, their root causes are not understood. Such understanding is extremely important to improve efficiency; even a 1% performance gain for a core can have a large impact on the datacenter as a whole. This paper studies the characteristics of a Big Data Analytics (BDA) workload on a modern cloud server. It is intentionally focused on a single workload-platform in order to enable deep-dive analysis that aims to understand the root causes of the CPU bottlenecks which this paper identify. We choose the Data Analytics benchmark from CloudSuite [1] as a representative of a growing family of important applications. This paper describes a customization of a comprehensive threefold analysis method. The method consists of a System level, where sensitivity to system parameters is examined, as well as Application and Architectural levels; where bottlenecks are attributed back to the application and runtime codes, respectively. The paper also adopts a proof-by-optimization approach to prove bottlenecks´ validity. Overall, 65% net speedup is measured with significant power reduction. The paper reveals that BDA workloads suffer from overheads related to managing the data rather than accessing the data. For example, Hash index lookup is found to be a key performance limiter. Inefficiencies leading to such unexpected behavior are demonstrated, including JVM selection and heavily unoptimized application code, both of which have a big impact. Suboptimal microarchitecture areas are demonstrated too, in addition to programming styles that limit exploitation of upcoming JVM and CPU parallelization features.
Keywords :
Big Data; cloud computing; computer centres; data analysis; table lookup; BDA workload; CPU bottleneck; CPU parallelization feature; CloudSuite; Hash index lookup; JVM selection; big data analytics workload; cloud server; comprehensive threefold analysis method; datacenter; deep-dive analysis; digital data; key performance limiter; massively-parallel system; power reduction; programming styles; proof-by-optimization approach; runtime code; scale-out workload; single workload-platform; special orchestration layer; suboptimal microarchitecture area; system parameter; unoptimized application code; Computer architecture; Data analysis; Instruction sets; Measurement; Optimization; Runtime;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Workload Characterization (IISWC), 2014 IEEE International Symposium on
Conference_Location :
Raleigh, NC
Print_ISBN :
978-1-4799-6452-9
Type :
conf
DOI :
10.1109/IISWC.2014.6983059
Filename :
6983059
Link To Document :
بازگشت