DocumentCode :
2885700
Title :
Using Online Classified Ads to Identify the Geographic Footprints of Anonymous, Casual Sex-Seeking Individuals
Author :
Fries, J.A. ; Segre, A.M. ; Polgreen, P.M.
Author_Institution :
Dept. of Comput. Sci., Univ. of Iowa, Iowa City, IA, USA
fYear :
2012
fDate :
3-5 Sept. 2012
Firstpage :
402
Lastpage :
410
Abstract :
This paper describes a method of using Craig list personal ads to better understand the movement behavior of anonymous, casual sex-seeking individuals within the men-who-have-sex-with-men community. Given recent dramatic increases in HIV and sexually transmitted disease within this community, gaining insight into how sexual networks connect neighborhoods and cities is important for formulating public health interventions. Due to the high degree of similarity exhibited by subsets of Craig list ads, and the presumption that a set of near-identical ads, when not spam, originate from the same author, we can apply techniques for efficient near-duplicate detection to identify clusters of near-identical ads. By examining each of these clusters and identifying differences in user-supplied location tags, we can then reconstruct an approximation of an anonymous individual´s movement footprint over time, as well as estimate the rate at which ad authors seek sexual encounters. For the state of California, we find that 86% of all encounter requests for a given set occur within a 50 mile area, with only less that 4% of messages reflecting long-distance travel over 250 miles. 60% of all detected clusters reposted ads within 2 weeks of the first detected post. We show that even in the relatively noisy, unstructured data environment of anonymous personal ads, it is still possible to extract meaningful signal and identify useful social network properties for analysis.
Keywords :
diseases; health care; social networking (online); California; Craig list personal ads; HIV; anonymous casual sex-seeking individuals; geographic footprints; men-who-have-sex-with-men community; online classified ads; public health; sexual networks; sexually transmitted disease; social network; user-supplied location tags; Cities and towns; Communities; Educational institutions; Entropy; Feature extraction; Human immunodeficiency virus; Public healthcare; Craigslist; geography; near-duplicate detection; networks; sexual behavior; sexually transmitted diseases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Privacy, Security, Risk and Trust (PASSAT), 2012 International Conference on and 2012 International Confernece on Social Computing (SocialCom)
Conference_Location :
Amsterdam
Print_ISBN :
978-1-4673-5638-1
Type :
conf
DOI :
10.1109/SocialCom-PASSAT.2012.86
Filename :
6406381
Link To Document :
بازگشت