DocumentCode
3125959
Title
On the Hardness of Graph Anonymization
Author
Aggarwal, Charu C. ; Li, Yao ; Yu, Philip S.
Author_Institution
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
fYear
2011
fDate
11-14 Dec. 2011
Firstpage
1002
Lastpage
1007
Abstract
In this paper, we examine the problem of node re-identification from anonymized graphs. Typical graphs encountered in real applications are massive and sparse. In this paper, we will show that massive and sparse graphs have certain theoretical properties which make them susceptible to re-identification attacks. We design a systematic way to exploit these theoretical properties in order to construct re- identification signatures, which are also known as characteristic vectors. These signatures have the property that they are extremely robust to perturbations, especially for massive and sparse graphs. Our results show that even low levels of anonymization require perturbation levels which are significant enough to result in a massive loss of utility. Our experimental results also show that the true anonymization level of graphs is much lower than is implied by measures such as k-anonymity. Thus, the results of this paper establish that the problem of massive graph anonymization has fundamental theoretical barriers which prevent a fully effective solution.
Keywords
data analysis; data mining; digital signatures; graph theory; social networking (online); characteristic vectors; data analysis; data mining; graph anonymization; massive graphs; node re-identification; re-identification attacks; re-identification signatures; social network; sparse graphs; Approximation methods; Couplings; Privacy; Random variables; Robustness; Social network services; Vectors; anonymization; graphs; privacy;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location
Vancouver,BC
ISSN
1550-4786
Print_ISBN
978-1-4577-2075-8
Type
conf
DOI
10.1109/ICDM.2011.112
Filename
6137305
Link To Document