DocumentCode :
1442342
Title :
Publishing Search Logs—A Comparative Study of Privacy Guarantees
Author :
Götz, Michaela ; Machanavajjhala, Ashwin ; Wang, Guozhang ; Xiao, Xiaokui ; Gehrke, Johannes
Author_Institution :
Dept. of Comput. Sci., Cornell Univ., Ithaca, NY, USA
Volume :
24
Issue :
3
fYear :
2012
fDate :
3/1/2012 12:00:00 AM
Firstpage :
520
Lastpage :
532
Abstract :
Search engine companies collect the “database of intentions,” the histories of their users´ search queries. These search logs are a gold mine for researchers. Search engine companies, however, are wary of publishing search logs in order not to disclose sensitive information. In this paper, we analyze algorithms for publishing frequent keywords, queries, and clicks of a search log. We first show how methods that achieve variants of k-anonymity are vulnerable to active attacks. We then demonstrate that the stronger guarantee ensured by ε-differential privacy unfortunately does not provide any utility for this problem. We then propose an algorithm ZEALOUS and show how to set its parameters to achieve (ε, δ)-probabilistic privacy. We also contrast our analysis of ZEALOUS with an analysis by Korolova et al. [17] that achieves (ε´,δ´)-indistinguishability. Our paper concludes with a large experimental study using real applications where we compare ZEALOUS and previous work that achieves k-anonymity in search log publishing. Our results show that ZEALOUS yields comparable utility to k-anonymity while at the same time achieving much stronger privacy guarantees.
Keywords :
data privacy; search engines; ZEALOUS algorithm; k-anonymity; privacy guarantees; probabilistic privacy; search engine; search log click; search log keyword; search log publishing; search log query; user search query; Histograms; History; Indexes; Privacy; Probabilistic logic; Publishing; Search engines; Security; and protection; database management; general; information storage and retrieval; information technology and systems; information technology and systems.; integrity; web search;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2011.26
Filename :
5708146
Link To Document :
بازگشت