DocumentCode
61701
Title
A Survey of Distributed Data Aggregation Algorithms
Author
Jesus, Paulo ; Baquero, Carlos ; Almeida, Paulo Sergio
Author_Institution
HASLab., Univ. do Minho, Braga, Portugal
Volume
17
Issue
1
fYear
2015
fDate
Firstquarter 2015
Firstpage
381
Lastpage
404
Abstract
Distributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, which can then be used to direct the execution of other applications. The resulting values are derived by the distributed computation of functions like Count, Sum, and Average. Some application examples deal with the determination of the network size, total storage capacity, average load, majorities and many others. In the last decade, many different approaches have been proposed, with different trade-offs in terms of accuracy, reliability, message and time complexity. Due to the considerable amount and variety of aggregation algorithms, it can be difficult and time consuming to determine which techniques will be more appropriate to use in specific settings, justifying the existence of a survey to aid in this task. This work reviews the state of the art on distributed data aggregation algorithms, providing three main contributions. First, it formally defines the concept of aggregation, characterizing the different types of aggregation functions. Second, it succinctly describes the main aggregation techniques, organizing them in a taxonomy. Finally, it provides some guidelines toward the selection and use of the most relevant techniques, summarizing their principal characteristics.
Keywords
computational complexity; data handling; distributed algorithms; average function; average load; count function; decentralized determination; distributed computation; distributed data aggregation algorithm; message complexity; sum function; time complexity; total storage capacity; Distributed databases; Network topology; Peer-to-peer computing; Routing; Taxonomy; Topology; Wireless sensor networks; Distributed algorithms; data aggregation; fault-tolerance; performance trade-offs;
fLanguage
English
Journal_Title
Communications Surveys & Tutorials, IEEE
Publisher
ieee
ISSN
1553-877X
Type
jour
DOI
10.1109/COMST.2014.2354398
Filename
6894544
Link To Document