DocumentCode
170602
Title
Optimal approximation algorithm of virtual machine placement for data latency minimization in cloud systems
Author
Jian-Jhih Kuo ; Hsiu-Hsien Yang ; Ming-Jer Tsai
Author_Institution
Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
fYear
2014
fDate
April 27 2014-May 2 2014
Firstpage
1303
Lastpage
1311
Abstract
The MapReduce/Hadoop architecture has become very important and effective in cloud systems because many data-intensive applications are usually required to process big data. In such environments, big data is partitioned and stored over several data nodes; thus, the total completion time of a task would be delayed if the maximum access latency among all pairs of a data node and its assigned computation node is not bounded. Moreover, the computation nodes usually need to communicate with each other for aggregating the computation results; therefore, the maximum access latency among all pairs of assigned computation nodes also needs to be bounded. In the literature, it has been proved that the placement problem of computation nodes (virtual machines) to minimize the maximum access latency among all pairs of a data node and its assigned computation node and among all pairs of assigned computation nodes does not admit any approximation algorithm with a factor smaller than two, whereas no approximation algorithms have been proposed so far. In this paper, we first propose a 3-approximation algorithm for resolving the problem. Subsequently, we close the gap by proposing a 2-approximation algorithm, that is, an optimal approximation algorithm, for resolving the problem in the price of higher time complexity. Finally, we conduct simulations for evaluating the performance of our algorithms.
Keywords
Big Data; approximation theory; cloud computing; computational complexity; minimisation; virtual machines; 2-approximation algorithm; 3-approximation algorithm; Big Data processing; MapReduce-Hadoop architecture; cloud systems; computation node placement problem; data access latency; data latency minimization; data node; data-intensive applications; maximum access latency minimization; optimal approximation algorithm; time complexity; virtual machine placement; Approximation algorithms; Approximation methods; Big data; Bipartite graph; Conferences; Three-dimensional displays; Virtual machining;
fLanguage
English
Publisher
ieee
Conference_Titel
INFOCOM, 2014 Proceedings IEEE
Conference_Location
Toronto, ON
Type
conf
DOI
10.1109/INFOCOM.2014.6848063
Filename
6848063
Link To Document