DocumentCode :
658329
Title :
Exploring Computation Locality of Graph Mining Algorithms on MapReduce
Author :
Qiuhong Li ; Ke Dai ; Wei Wang ; Peng Wang ; Rongming He ; Mingxiu Dong
Author_Institution :
Sch. of Comput. Sci., Fudan Univ., Shanghai, China
Volume :
1
fYear :
2013
fDate :
17-20 Nov. 2013
Firstpage :
45
Lastpage :
50
Abstract :
Previous implementations of graph mining algorithms on MapReduce ignore the characteristic of locality in distributed systems. For distributed systems, locality means the operations take place in local computing nodes without the communication with remote computing nodes. In this paper we present LI-MR (Local Iteration MapReduce) framework to improve a class of graph operators which can be described by repeated matrix-vector multiplications. LI-MR considers locality of sub graphs and adopts coarse granularity of communication unit for MapReduce. In particular, for sub graphs, only partial operations need synchronization. We propose a method to implement random data access on Hadoop by outputting the results to HBase. With the support of range query provided by HBase, LI-MR allows sub graphs to fulfil computation task with enough information in main memory. Because the locality feature of sub graphs, the info for the computation is limited. In this way, LI-MR framework combines in-memory computation with MapReduce model for graph algorithms.
Keywords :
data mining; graph theory; iterative methods; matrix multiplication; parallel algorithms; public domain software; query processing; random processes; vectors; HBase; Hadoop; LI-MR framework; MapReduce model; coarse granularity; communication unit; computation locality; distributed systems; graph mining algorithms; graph operators; in-memory computation; local Iteration MapReduce framework; local computing nodes; random data access; range query; repeated matrix-vector multiplication; subgraph locality feature; synchronization; Aggregates; Algorithm design and analysis; Clustering algorithms; Computational modeling; Optimization; Partitioning algorithms; Synchronization; MapReduce; distributed data mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4799-2902-3
Type :
conf
DOI :
10.1109/WI-IAT.2013.7
Filename :
6689992
Link To Document :
بازگشت