DocumentCode :
1483074
Title :
Probabilistic Mining of Socio-Geographic Routines From Mobile Phone Data
Author :
Farrahi, Katayoun ; Gatica-Perez, Daniel
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
Volume :
4
Issue :
4
fYear :
2010
Firstpage :
746
Lastpage :
755
Abstract :
There is relatively little work on the investigation of large-scale human data in terms of multimodality for human activity discovery. In this paper, we suggest that human interaction data, or human proximity, obtained by mobile phone Bluetooth sensor data, can be integrated with human location data, obtained by mobile cell tower connections, to mine meaningful details about human activities from large and noisy datasets. We propose a model, called bag of multimodal behavior, that integrates the modeling of variations of location over multiple time-scales, and the modeling of interaction types from proximity. Our representation is simple yet robust to characterize real-life human behavior sensed from mobile phones, which are devices capable of capturing large-scale data known to be noisy and incomplete. We use an unsupervised approach, based on probabilistic topic models, to discover latent human activities in terms of the joint interaction and location behaviors of 97 individuals over the course of approximately a 10-month period using data from MIT´s Reality Mining project. Some of the human activities discovered with our multimodal data representation include “going out from 7 pm-midnight alone” and “working from 11 am-5 pm with 3-5 other people,” further finding that this activity dominantly occurs on specific days of the week. Our methodology also finds dominant work patterns occurring on other days of the week. We further demonstrate the feasibility of the topic modeling framework for human routine discovery by predicting missing multimodal phone data at specific times of the day.
Keywords :
data mining; probability; social sciences computing; unsupervised learning; bluetooth sensor; human activity discovery; mobile phone data; multimodal data representation; probabilistic mining; socio-geographic routines; Reality Mining; human activity; human mobility; topic models;
fLanguage :
English
Journal_Title :
Selected Topics in Signal Processing, IEEE Journal of
Publisher :
ieee
ISSN :
1932-4553
Type :
jour
DOI :
10.1109/JSTSP.2010.2049513
Filename :
5457969
Link To Document :
بازگشت