Title :
A time-efficient connected densest subgraph discovery algorithm for big data
Author :
Wu, Bo ; Shen, Haiying
Author_Institution :
Department of Electrical and Computer Engineering, Clemson University, South Carolina 29634, USA
Abstract :
In this paper, we propose a time-efficient and exact algorithm for the problem of discovering the densest subgraph in big data. Current algorithms for solving this problem have three problems: i) they cannot handle the dilemma between the efficiency of handing big data and the precision of the discovered densest subgraph; ii) they cannot take advantage of both the parallel computing on MapReduce and in-memory computing on one computer; iii) their applicability to different kinds of graphs has not been discussed. Our proposed algorithm combines the MapReduce parallel computing with in-memory computing on one computer together to improve the efficiency and precision of discovering the densest subgraphs. The algorithm consists of two computational phases: i) the graph reduction in the MapReduce framework; ii) the densest subgraph discovery in memory. Further, we theoretically analyze the correctness of this algorithm and its applicability in different natural graphs. We conduct extensive experimental evaluations in a MapReduce framework on both massive real-world graphs and simulated graphs to test our algorithm in comparison with other algorithms. Experimental results show that our algorithm is more time-efficient and precise than other algorithms.
Keywords :
Algorithm design and analysis; Approximation algorithms; Big data; Complex networks; Computers; Parallel processing; Topology;
Conference_Titel :
Networking, Architecture and Storage (NAS), 2015 IEEE International Conference on
Conference_Location :
Boston, MA, USA
DOI :
10.1109/NAS.2015.7255197