DocumentCode :
3158702
Title :
@Phillies Tweeting from Philly? Predicting Twitter User Locations with Spatial Word Usage
Author :
Hau-wen Chang ; Dongwon Lee ; Eltaher, Mohammad ; Jeongkyu Lee
Author_Institution :
Pennsylvania State Univ., University Park, PA, USA
fYear :
2012
fDate :
26-29 Aug. 2012
Firstpage :
111
Lastpage :
118
Abstract :
We study the problem of predicting home locations of Twitter users using contents of their tweet messages. Using three probability models for locations, we compare both the Gaussian Mixture Model (GMM) and the Maximum Likelihood Estimation (MLE). In addition, we propose two novel unsupervised methods based on the notions of Non-Localness and Geometric-Localness to prune noisy data from tweet messages. In the experiments, our unsupervised approach improves the baselines significantly and shows comparable results with the supervised state-of-the-art method. For 5,113 Twitter users in the test set, on average, our approach with only 250 selected local words or less is able to predict their home locations (within 100 miles) with the accuracy of 0.499, or has 509.3 miles of average error distance at best.
Keywords :
Gaussian processes; geographic information systems; maximum likelihood estimation; social networking (online); Gaussian mixture model; MLE; Twitter user location; geometric-localness; home location prediction; maximum likelihood estimation; noisy data; nonlocalness; probability model; spatial word usage; tweet message content; unsupervised method; Accuracy; Cities and towns; Maximum likelihood estimation; Predictive models; Twitter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2012 IEEE/ACM International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4673-2497-7
Type :
conf
DOI :
10.1109/ASONAM.2012.29
Filename :
6425775
Link To Document :
بازگشت