DocumentCode
659400
Title
Communication efficient algorithms for fundamental big data problems
Author
Sanders, P. ; Schlag, Sebastian ; Muller, Ian
Author_Institution
Inst. of Theor. Inf., Karlsruhe Inst. of Technol., Karlsruhe, Germany
fYear
2013
fDate
6-9 Oct. 2013
Firstpage
15
Lastpage
23
Abstract
Big Data applications often store or obtain their data distributed over many computers connected by a network. Since the network is usually slower than the local memory of the machines, it is crucial to process the data in such a way that not too much communication takes place. Indeed, only communication volume sublinear in the input size may be affordable. We believe that this direction of research deserves more intensive study. We give examples for several fundamental algorithmic problems where nontrivial algorithms with sublinear communication volume are possible. Our main technical contribution are several related results on distributed Bloom filter replacements, duplicate detection, and data base join. As an example of a very different family of techniques, we discuss linear programming in low dimensions.
Keywords
Big Data; data structures; linear programming; Big Data applications; Big Data problems; communication efficient algorithms; computer network; data distribution; data processing; database; distributed Bloom filter replacements; duplicate detection; linear programming; nontrivial algorithms; sublinear communication volume; Arrays; Computational modeling; Data handling; Data storage systems; Distributed databases; Information management; algorithm; communication volume; data base; duplicate detection; join; linear programming;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data, 2013 IEEE International Conference on
Conference_Location
Silicon Valley, CA
Type
conf
DOI
10.1109/BigData.2013.6691549
Filename
6691549
Link To Document