Title :
Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling
Author :
Farrahi, Katayoun ; Gatica-Perez, Daniel
Author_Institution :
IDIAP Res. Inst., Ecole Polytech. Fedvrale de Lausanne (EPFL), Lausanne, Switzerland
Abstract :
In this work we address the problem of modeling varying time duration sequences for large-scale human routine discovery from cellphone sensor data using a multi-level approach to probabilistic topic models. We use an unsupervised learning approach that discovers human routines of varying durations ranging from half-hourly to several hours. Our methodology can handle large sequence lengths based on a principled procedure to deal with potentially large routine-vocabulary sizes, and can be applied to rather naive initial vocabularies to discover meaningful location-routines. We successfully apply the model to a large, real-life dataset, consisting of 97 cellphone users and 16 months of their location patterns, to discover routines with varying time durations.
Keywords :
data mining; probability; social sciences computing; unsupervised learning; cellphone sensor data; data mining; human location-routines; multi-level approach; probabilistic topic models; unsupervised learning approach; Data models; Humans; Markov processes; Mobile handsets; Probabilistic logic; Visualization; Vocabulary;
Conference_Titel :
Social Computing (SocialCom), 2010 IEEE Second International Conference on
Conference_Location :
Minneapolis, MN
Print_ISBN :
978-1-4244-8439-3
Electronic_ISBN :
978-0-7695-4211-9
DOI :
10.1109/SocialCom.2010.71