DocumentCode
610309
Title
Enumerating subgraph instances using map-reduce
Author
Afrati, F.N. ; Fotakis, D. ; Ullman, J.D.
Author_Institution
Nat. Tech. Univ. of Athens, Athens, Greece
fYear
2013
fDate
8-12 April 2013
Firstpage
62
Lastpage
73
Abstract
The theme of this paper is how to find all instances of a given “sample” graph in a larger “data graph,” using a single round of map-reduce. For the simplest sample graph, the triangle, we improve upon the best known such algorithm. We then examine the general case, considering both the communication cost between mappers and reducers and the total computation cost at the reducers. To minimize communication cost, we exploit the techniques of [1] for computing multiway joins (evaluating conjunctive queries) in a single map-reduce round. Several methods are shown for translating sample graphs into a union of conjunctive queries with as few queries as possible. We also address the matter of optimizing computation cost. Many serial algorithms are shown to be “convertible,” in the sense that it is possible to partition the data graph, explore each partition in a separate reducer, and have the total computation cost at the reducers be of the same order as the computation cost of the serial algorithm.
Keywords
graph theory; parallel algorithms; query processing; Map-Reduce; communication cost; computation cost; conjunctive queries; data graph; multiway joins; subgraph instance enumeration; Abstracts; Approximation algorithms; Complexity theory; Educational institutions; Partitioning algorithms; Probabilistic logic; Silicon;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering (ICDE), 2013 IEEE 29th International Conference on
Conference_Location
Brisbane, QLD
ISSN
1063-6382
Print_ISBN
978-1-4673-4909-3
Electronic_ISBN
1063-6382
Type
conf
DOI
10.1109/ICDE.2013.6544814
Filename
6544814
Link To Document