DocumentCode :
704250
Title :
Finding the Big Data Sweet Spot: Towards Automatically Recommending Configurations for Hadoop Clusters on Docker Containers
Author :
Rui Zhang ; Min Li ; Hildebrand, Dean
fYear :
2015
fDate :
9-13 March 2015
Firstpage :
365
Lastpage :
368
Abstract :
The complexity of cloud-based analytics environments threatens to undermine their otherwise tremendous values. In particular, configuring such environments presents a great challenge. We propose to alleviate this issue with an engine that recommends configurations for a newly submitted analytics job in an intelligent and timely manner. The engine is rooted in a modified k-nearest neighbor algorithm, which finds desirable configurations from similar past jobs that have performed well. We apply the method to configuring an important class of analytics environments: Hadoop on container-driven clouds. Preliminary evaluation suggests up to 28% performance gain could result from our method.
Keywords :
Big Data; cloud computing; Big Data sweet spot; Hadoop; cloud-based analytics environment complexity; configuration automatic recommendation; container-driven clouds; k-nearest neighbor algorithm; Big data; Containers; Engines; Linux; Performance gain; Resource management; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Engineering (IC2E), 2015 IEEE International Conference on
Conference_Location :
Tempe, AZ
Type :
conf
DOI :
10.1109/IC2E.2015.101
Filename :
7092945
Link To Document :
بازگشت