DocumentCode :
3450970
Title :
Partitioning Social Networks for Fast Retrieval of Time-Dependent Queries
Author :
Yuan, Mindi ; Stein, David ; Carrasco, Berenice ; Trindade, Joana M F ; Lu, Yi
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear :
2012
fDate :
1-5 April 2012
Firstpage :
205
Lastpage :
212
Abstract :
Online social network (OSN) queries require retrievals of multiple small records generated by different users in the network, and the set of records to be retrieved is time dependent. Current implementation of hash-based partitioning results in accesses at a large number of servers, which significantly degrades response time. Partitioning the OSN friendship graph is difficult as its power-law degree distribution leads to many cross-partition edges. Naive replication requires extra storage that is orders of magnitude larger. In our previous work (2011), we proposed to partition not only the spatial network of social relations, but also in the time dimension so that users who have communicated in a given period are grouped together. We built an activity prediction graph (APG) to keep in one partition newly created data that are highly likely to be accessed together. In this paper, we analyze the distribution of the Facebook wall posts in the New Orleans network. We further emphasize that the objective of partitioning is to keep the two-hop neighborhood of a user in one partition, instead of the one-hop network usually considered. Two-hop neighborhoods are the basic units of retrieval in OSN and can be much larger than one-hop networks. We use a static partitioning method based on KMETIS, and a dynamic local partitioning method that maintains evenness and requires only a small amount of data movement across partitions. For evaluation, the partitioning results are tested with emulation of Facebook page downloads. We show that partitioning on two-hop networks yields at lest 19% more local queries than its one-hop counterpart. The static algorithm achieves 5.6 times better data locality than hash-based partitioning and the dynamic algorithm achieves 6.4 times better locality while keeping the number of movements small. Almost all queries are kept in at most 3 partitions for both algorithms.
Keywords :
graph theory; query processing; social networking (online); APG; Facebook page downloads; Facebook wall post distribution; KMETIS; New Orleans network; OSN friendship graph; OSN query; activity prediction graph; cross-partition edges; dynamic algorithm; dynamic local partitioning method; hash-based partitioning; naive replication; one-hop network; online social network partitioning; power-law degree distribution; record retrieval; social relation spatial network; static algorithm; static partitioning method; time dimension; time-dependent query retrieval; user two-hop neighborhood; Correlation; Facebook; Heuristic algorithms; Partitioning algorithms; Prediction algorithms; Receivers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering Workshops (ICDEW), 2012 IEEE 28th International Conference on
Conference_Location :
Arlington, VA
Print_ISBN :
978-1-4673-1640-8
Type :
conf
DOI :
10.1109/ICDEW.2012.63
Filename :
6313681
Link To Document :
بازگشت