DocumentCode :
243608
Title :
EgoLP: Fast and Distributed Community Detection in Billion-Node Social Networks
Author :
Buzun, Nazar ; Korshunov, Anton ; Avanesov, Valeriy ; Filonenko, Ilya ; Kozlov, Ilya ; Turdakov, Denis ; Hangkyu Kim
Author_Institution :
Inst. for Syst. Program., Fed. Agency of Sci. Organizations, Moscow, Russia
fYear :
2014
fDate :
14-14 Dec. 2014
Firstpage :
533
Lastpage :
540
Abstract :
Community structure is one of the most important and characteristic features of social networks. Numerous methods for discovering implicit user communities from a social graph of users have been proposed in recent years. However, most of them have performance and scalability issues which make them hardly applicable to population-wide analysis of modern social networks (billions of users and growing). In this paper we present EgoLP - an efficient and fully distributed method for social community detection. The method is based on propagating community labels through the network with the help of friendship groups of individual users. Experimental evaluation of Apache Spark implementation of the method showed that it outperforms some state-of-the-art methods in terms of a) similarity of extracted communities to the reference ones from synthetic networks, b) precision of user attributes prediction in Facebook based solely on community memberships, c) likelihood of the discovered community structure according to the proposed generative model. At the same time, the method retains near-linear complexity in the number of edges and is thus applicable to social graphs of up to 109 users.
Keywords :
computational complexity; graph theory; social networking (online); Apache Spark implementation; EgoLP; Facebook; billion-node social networks; community label propagation; community memberships; community structure; distributed community detection; near-linear complexity; social graphs; synthetic networks; user attribute prediction; Accuracy; Communities; Image edge detection; Receivers; Scalability; Social network services; Sparks; Community detection; distributed algorithms; graph clustering; social networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
Type :
conf
DOI :
10.1109/ICDMW.2014.158
Filename :
7022642
Link To Document :
بازگشت