DocumentCode :
1806725
Title :
Optimal communication structures for big data aggregation
Author :
Culhane, William ; Kogan, Kirill ; Jayalath, Chamikara ; Eugster, Patrick
fYear :
2015
fDate :
April 26 2015-May 1 2015
Firstpage :
1643
Lastpage :
1651
Abstract :
Aggregation of computed sets of results fundamentally underlies the distillation of information in many of today´s big data applications. To this end there are many systems which have been introduced which allow users to obtain aggregate results by aggregating along communication structures such as trees, but they do not focus on optimizing performance by optimizing the underlying structure to perform the aggregation. We consider two cases of the problem - aggregation of (1) single blocks of data, and of (2) streaming input. For each case we determine which metric of “fast” completion is the most relevant and mathematically model resulting systems based on aggregation trees to optimize that metric. Our assumptions and model are laid out in depth. From our model we determine how to create a provably ideal aggregation tree (i.e., with optimal fanin) using only limited information about the aggregation function being applied. Experiments in the Amazon Elastic Compute Cloud (EC2) confirm the validatity of our models in practice.
Keywords :
Big Data; data handling; Amazon Elastic Compute Cloud; Big Data aggregation; Big Data applications; EC2; aggregation trees; communication structures; information distillation; Aggregates; Bandwidth; Big data; Computational modeling; Computers; Conferences; Mathematical model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Communications (INFOCOM), 2015 IEEE Conference on
Conference_Location :
Kowloon
Type :
conf
DOI :
10.1109/INFOCOM.2015.7218544
Filename :
7218544
Link To Document :
بازگشت