DocumentCode :
625049
Title :
Exploiting Foursquare and Cellular Data to Infer User Activity in Urban Environments
Author :
Noulas, Anastasios ; Mascolo, Cecilia
Author_Institution :
Comput. Lab., Univ. of Cambridge, Cambridge, UK
Volume :
1
fYear :
2013
fDate :
3-6 June 2013
Firstpage :
167
Lastpage :
176
Abstract :
Inferring the type of activities in neighborhoods of urban centers may be helpful in a number of contexts including urban planning, content delivery and activity recommendations for mobile web users or may even yield to a deeper understanding of the geographical evolution of social life in the city . During the past few years, the analysis of mobile phone usage patterns, or of social media with longitudinal attributes, have aided the automatic characterization of the dynamics of the urban environment. In this work, we combine a dataset sourced from a telecommunication provider in Spain with a database of millions of geotagged venues from Foursquare and we formulate the problem of urban activity inference in a supervised learning framework. In particular, we exploit user communication patterns observed at the base station level in order to predict the activity of Foursquare users who checkin-in at nearby venues. First, we mine a set of machine learning features that allow us to encode the input telecommunication signal of a tower. Subsequently, we evaluate a diverse set of supervised learning algorithms using labels extracted from Foursquare place categories and we consider two application scenarios. Initially, we assess how hard it is to predict specific urban activity of an area, showing that Nightlife and Entertainment spots are those easier to infer, whereas College and Shopping areas are those featuring the lowest accuracy rates. Then, considering a candidate set of activity types in a geographic area, we aim to elect the most prominent one. We demonstrate how the difficulty of the problem increases with the number of classes incorporated in the prediction task, yet the classifiers achieve a considerably better performance compared to a random guess even when the set of candidate classes increases.
Keywords :
cellular radio; learning (artificial intelligence); mobile computing; Foursquare; cellular data; content delivery; geographical evolution; geotagged venue; longitudinal attribute; machine learning; mobile phone usage pattern; social media; supervised learning; urban activity inference; urban environment; urban planning; user activity; user communication pattern; Cities and towns; Entertainment industry; Entropy; Mobile communication; Poles and towers; Supervised learning; cellular data; location-based services; urban mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mobile Data Management (MDM), 2013 IEEE 14th International Conference on
Conference_Location :
Milan
Print_ISBN :
978-1-4673-6068-5
Type :
conf
DOI :
10.1109/MDM.2013.27
Filename :
6569133
Link To Document :
بازگشت