DocumentCode :
153571
Title :
Cloak and Swagger: Understanding Data Sensitivity through the Lens of User Anonymity
Author :
Peddinti, Sai Teja ; Korolova, Aleksandra ; Bursztein, Elie ; Sampemane, Geetanjali
Author_Institution :
Polytech. Sch. of Eng., New York Univ., New York, NY, USA
fYear :
2014
fDate :
18-21 May 2014
Firstpage :
493
Lastpage :
508
Abstract :
Most of what we understand about data sensitivity is through user self-report (e.g., surveys), this paper is the first to use behavioral data to determine content sensitivity, via the clues that users give as to what information they consider private or sensitive through their use of privacy enhancing product features. We perform a large-scale analysis of user anonymity choices during their activity on Quora, a popular question-and-answer site. We identify categories of questions for which users are more likely to exercise anonymity and explore several machine learning approaches towards predicting whether a particular answer will be written anonymously. Our findings validate the viability of the proposed approach towards an automatic assessment of data sensitivity, show that data sensitivity is a nuanced measure that should be viewed on a continuum rather than as a binary concept, and advance the idea that machine learning over behavioral data can be effectively used in order to develop product features that can help keep users safe.
Keywords :
data privacy; learning (artificial intelligence); Quora; automatic assessment; behavioral data; cloak; content sensitivity; data sensitivity; machine learning; privacy enhancing product features; question-and-answer site; swagger; user anonymity; user self-report; Context; Crawlers; Data privacy; Facebook; Privacy; Search engines; Sensitivity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Security and Privacy (SP), 2014 IEEE Symposium on
Conference_Location :
San Jose, CA
ISSN :
1081-6011
Type :
conf
DOI :
10.1109/SP.2014.38
Filename :
6956583
Link To Document :
بازگشت