Title :
Synchronous Parallel Processing of Big-Data Analytics Services to Optimize Performance in Federated Clouds
Author :
Jung, Gueyoung ; Gnanasambandam, Nathan ; Mukherjee, Tridib
Author_Institution :
Xerox Res. Center Webster, Webster, MA, USA
Abstract :
Parallelization of big-data analytics services over a federation of heterogeneous clouds has been considered to improve performance. However, contrary to common intuition, there is an inherent tradeoff between the level of parallelism and the performance for big-data analytics principally because of a significant delay for big-data to get transferred over the network. The data transfer delay can be comparable or even higher than the time required to compute data. To address the aforementioned tradeoff, this paper determines: (a) how many and which computing nodes in federated clouds should be used for parallel execution of big-data analytics; (b) opportunistic apportioning of big-data to these computing nodes in a way to enable synchronized completion at best-effort performance; and (c) sequence of apportioned, different sizes of big-data chunks to be computed in each node so that transfer of a chunk is overlapped as much as possible with the computation of the previous chunk in the node. In this regard, Maximally Overlapped Bin-packing driven Bursting (MOBB) algorithm is proposed, which improve the performance by up to 60% against existing approaches.
Keywords :
bin packing; cloud computing; data analysis; software performance evaluation; MOBB algorithm; big-data analytics services; big-data chunks; computing nodes; data transfer delay; federated clouds; heterogeneous clouds; maximally overlapped bin-packing driven bursting algorithm; opportunistic apportioning; parallel execution; performance improvement; performance optimization; synchronous parallel processing; Computational modeling; Data mining; Delay; Estimation; Parallel processing; Sorting; Synchronization; big-data analytics; federated clouds; parallelization;
Conference_Titel :
Cloud Computing (CLOUD), 2012 IEEE 5th International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4673-2892-0
DOI :
10.1109/CLOUD.2012.108