Title :
Finding the Big Data Sweet Spot: Towards Automatically Recommending Configurations for Hadoop Clusters on Docker Containers
Author :
Rui Zhang ; Min Li ; Hildebrand, Dean
Abstract :
The complexity of cloud-based analytics environments threatens to undermine their otherwise tremendous values. In particular, configuring such environments presents a great challenge. We propose to alleviate this issue with an engine that recommends configurations for a newly submitted analytics job in an intelligent and timely manner. The engine is rooted in a modified k-nearest neighbor algorithm, which finds desirable configurations from similar past jobs that have performed well. We apply the method to configuring an important class of analytics environments: Hadoop on container-driven clouds. Preliminary evaluation suggests up to 28% performance gain could result from our method.
Keywords :
Big Data; cloud computing; Big Data sweet spot; Hadoop; cloud-based analytics environment complexity; configuration automatic recommendation; container-driven clouds; k-nearest neighbor algorithm; Big data; Containers; Engines; Linux; Performance gain; Resource management; Yarn;
Conference_Titel :
Cloud Engineering (IC2E), 2015 IEEE International Conference on
Conference_Location :
Tempe, AZ
DOI :
10.1109/IC2E.2015.101