DocumentCode :
2883933
Title :
Efficient Graph Models for Retrieving Top-k News Feeds from Ego Networks
Author :
Pickhardt, R. ; Gottron, T. ; Scherp, Ansgar ; Staab, S. ; Kunze, J.
Author_Institution :
Inst. for Web Sci. & Technol., Univ. of Koblenz-Landau, Koblenz, Germany
fYear :
2012
fDate :
3-5 Sept. 2012
Firstpage :
123
Lastpage :
133
Abstract :
A key challenge of web platforms like social networking sites and services for news feed aggregation is the efficient and targeted distribution of new content items to users. This can be formulated as the problem of retrieving the top-k news items out of the d-degree ego network of each given user, where the set of all users producing feeds is of size n, with n ≫ d ≫ k and typically k <; 20. Existing approaches employ either expensive join operations on global indices or suffer from high redundancy through denormalization. This makes retrieval of different top-k news feeds for thousands of users per second very inefficient in a large social network. In this paper, we propose two graph models GRAPHITY and STOU to address this problem. GRAPHITY is optimized for fast retrieval of news feeds and has a runtime of O(k log(k)). The GRAPHITY index does not involve data redundancy. An update of the index upon insertion of a new item to the feed is possible in a runtime linear to the nodes´ indegree din. New content can be stored in STOU in O(1) at the cost of slower retrieval speed of O(d log(d)). We verify the theoretical runtime complexity of GRAPHITY and STOU on two data sets of different characteristics and size. We show that on a single machine GRAPHITY is able to retrieve more than 10 000 unique news feeds per second in a network with more than one million users. Our evaluation confirms that retrieval of news feeds with GRAPHITY is independent of the node degree d of a user´s ego network and network size n and does scale to networks of arbitrary size.
Keywords :
graph theory; information retrieval; social networking (online); GRAPHITY graph models; STOU graph models; d-degree ego networks; denormalization; global indices; graph models; join operations; news feed aggregation; social networking sites; top-k news feed retrieval; Complexity theory; Data models; Feeds; Indexes; Runtime; Social network services; Sorting; graph data base; graph index; graphity; scalability; social network; social news feed; top k join;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Privacy, Security, Risk and Trust (PASSAT), 2012 International Conference on and 2012 International Confernece on Social Computing (SocialCom)
Conference_Location :
Amsterdam
Print_ISBN :
978-1-4673-5638-1
Type :
conf
DOI :
10.1109/SocialCom-PASSAT.2012.73
Filename :
6406277
Link To Document :
بازگشت