Title :
Efficient Parallel Community Detection in Large Edge-Intensive Networks
Author :
Guangliang Gao ; Zhan Bu ; Zhiang Wu ; Yuan Li ; Jie Cao
Author_Institution :
Coll. of Comput. Sci. & Eng., Nanjing Univ. of Sci. & Technol., Nanjing, China
Abstract :
Community detection is a classic and very difficult task in social network analysis. A large number of methods have been developed for both efficient and effective community detection. However, much of the existing methods are heavily dependent on the number of links in the network, and thus they often suffer from the computational inefficiency when meeting large edge-intensive networks. In this paper, we present a novel SIMPLifying and Ensembling (SIMPLE) framework for parallel community detection. It employs the random link sampling to simplify the network and obtain basic partitionings on every sampled graphs. Then, the K-means-based Consensus Clustering is used to ensemble a number of basic partitionings to get high-quality community structures. Meanwhile, steps of random sampling and sampled graph partitioning are encapsulated into MapReduce to further improve the efficiency. Experiments on four real-world social networks analyze key parameters and factors inside SIMPLE, and demonstrate the effectiveness of the SIMPLE.
Keywords :
complex networks; graph theory; information analysis; network theory (graphs); parallel programming; pattern clustering; sampling methods; social networking (online); MapReduce; SIMPLE framework; k-means-based consensus clustering; large edge-intensive networks; parallel community detection; random sampling; sampled graph partitioning; simplifying and ensembling framework; social network analysis; Approximation algorithms; Clustering algorithms; Communities; Image edge detection; Partitioning algorithms; Twitter; Social Network; Community Detection; Random;
Conference_Titel :
Advanced Cloud and Big Data (CBD), 2014 Second International Conference on
Print_ISBN :
978-1-4799-8086-4
DOI :
10.1109/CBD.2014.52