DocumentCode :
668115
Title :
Distance-aware virtual cluster performance optimization: A hadoop case study
Author :
Xinkui Zhao ; Jianwei Yin ; Zuoning Chen ; Xingjian Lu
Author_Institution :
Coll. of Comput. Sci., Zhejiang Univ., Hangzhou, China
fYear :
2013
fDate :
23-27 Sept. 2013
Firstpage :
1
Lastpage :
8
Abstract :
Cloud computing and big data are becoming two important developing trends in information technology area. However, data-intensive computing has some challenges to work well on virtual machines in cloud computing for virtualized resource competition and complex network communication. Network becomes one of the most notorious bottlenecks, which highlights strategies to lower communication and transmission cost in virtual cluster. In this paper, we present a novel cluster performance optimization strategy named vClusterOpt. vClusterOpt finds out centralized subgraphs of node graph and choose node with the shortest logical distance as kernel node of the subgraph to reduce inter-machine communication and transmission cost under virtual cluster. To calculate logical distance accurately, we define two kinds of logical distance: Logical Communication Distance(LCD) and Logical Transmission Distance(LTD). VM with the shortest LCD with others is used as the communication kernel node who has the most information communication stress, while VM with the shortest LTD is treated as transmission kernel node who has the most data transmission stress. We choose benchmarks running on Hadoop as the represent of data-intensive computing service to demonstrate effectiveness of our approach. Experiments show that an average of 20% performance improvement can get by our distance-aware virtual cluster optimization strategy.
Keywords :
cloud computing; data handling; graph theory; optimisation; pattern clustering; virtual machines; Hadoop case study; LCD; LTD; VM; big data; centralized subgraphs; cloud computing; cluster performance optimization strategy; complex network communication; data transmission stress; data-intensive computing; distance-aware virtual cluster performance optimization; information technology area; intermachine communication; kernel node; logical communication distance; logical transmission distance; node graph; shortest logical distance; transmission cost; transmission kernel node; vClusterOpt; virtual machines; virtualized resource competition; Cloud computing; Clustering algorithms; Kernel; Optimization; Peer-to-peer computing; Servers; Virtual machining; Hadoop; big data; cloud computing; distance-aware virtual cluster; virtual machine communication;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Conference_Location :
Indianapolis, IN
Type :
conf
DOI :
10.1109/CLUSTER.2013.6702618
Filename :
6702618
Link To Document :
بازگشت