DocumentCode :
3525067
Title :
Do You Feel the Lag of Your Hadoop?
Author :
Yuxuan Jiang ; Zhe Huang ; Tsang, Danny H. K.
Author_Institution :
Dept. of Electron. & Comput. Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
fYear :
2015
fDate :
March 30 2015-April 2 2015
Firstpage :
115
Lastpage :
119
Abstract :
The configuration of a Hadoop cluster is significantly important to its performance, because an improper configuration can greatly deteriorate the job execution performance. Unfortunately, systematic guidelines on how to configure a Hadoop cluster are still missing. In this paper, we undertake an empirical study on key operations and mechanisms of Hadoop job execution, including the task assignment strategy and speculative execution. Based on the experiments, we provide suggestions on the system configuration, particularly on the matching between the hardware resource partitioning scheme and the job splitting granularity.
Keywords :
data handling; parallel processing; pattern clustering; resource allocation; Hadoop cluster configuration; Hadoop job execution; hardware resource partitioning scheme; job splitting granularity; task assignment strategy; Cloud computing; Delays; Electronic publishing; Hardware; Indexes; Resource management; Time-domain analysis; Empirical Study; Hadoop; Performance Measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data Computing Service and Applications (BigDataService), 2015 IEEE First International Conference on
Conference_Location :
Redwood City, CA
Type :
conf
DOI :
10.1109/BigDataService.2015.14
Filename :
7184871
Link To Document :
بازگشت