DocumentCode :
1733287
Title :
The Geolocation of Web Logs from Textual Clues
Author :
Fink, Clayton ; Piatko, Christine ; Mayfield, James ; Chou, Danielle ; Finin, Tim ; Martineau, Justin
Author_Institution :
Appl. Phys. Lab., Johns Hopkins Univ., Laurel, MD, USA
Volume :
4
fYear :
2009
Firstpage :
1088
Lastpage :
1092
Abstract :
Understanding the spatial distribution of people who author social media content is of growing interest for researchers and commerce. Blogging platforms depend on authors reporting their own location. However, not all authors report or reveal their location on their blog´s home page. Automated geolocation strategies using IP address and domain name are not adequate for determining an author´s location because most blogs are not self-hosted. In this paper we describe a method that uses the place name mentions in a blog to determine an author´s location. We achieved an accuracy of 63% on a collection of 844 blogs with known locations.
Keywords :
Internet; Web sites; social networking (online); text analysis; transport protocols; IP address; Web logs; blog home page; domain name; geolocation; social media content; spatial distribution; textual clues; Blogs; Business; Computer science; Distributed computing; Information services; Internet; Life estimation; Physics computing; Testing; Web sites; disambiguation; geolocation; named entity recognition; social media;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Science and Engineering, 2009. CSE '09. International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4244-5334-4
Electronic_ISBN :
978-0-7695-3823-5
Type :
conf
DOI :
10.1109/CSE.2009.584
Filename :
5282996
Link To Document :
بازگشت