Title :
The Geolocation of Web Logs from Textual Clues
Author :
Fink, Clayton ; Piatko, Christine ; Mayfield, James ; Chou, Danielle ; Finin, Tim ; Martineau, Justin
Author_Institution :
Appl. Phys. Lab., Johns Hopkins Univ., Laurel, MD, USA
Abstract :
Understanding the spatial distribution of people who author social media content is of growing interest for researchers and commerce. Blogging platforms depend on authors reporting their own location. However, not all authors report or reveal their location on their blog´s home page. Automated geolocation strategies using IP address and domain name are not adequate for determining an author´s location because most blogs are not self-hosted. In this paper we describe a method that uses the place name mentions in a blog to determine an author´s location. We achieved an accuracy of 63% on a collection of 844 blogs with known locations.
Keywords :
Internet; Web sites; social networking (online); text analysis; transport protocols; IP address; Web logs; blog home page; domain name; geolocation; social media content; spatial distribution; textual clues; Blogs; Business; Computer science; Distributed computing; Information services; Internet; Life estimation; Physics computing; Testing; Web sites; disambiguation; geolocation; named entity recognition; social media;
Conference_Titel :
Computational Science and Engineering, 2009. CSE '09. International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4244-5334-4
Electronic_ISBN :
978-0-7695-3823-5
DOI :
10.1109/CSE.2009.584