• DocumentCode
    3125959
  • Title

    On the Hardness of Graph Anonymization

  • Author

    Aggarwal, Charu C. ; Li, Yao ; Yu, Philip S.

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    1002
  • Lastpage
    1007
  • Abstract
    In this paper, we examine the problem of node re-identification from anonymized graphs. Typical graphs encountered in real applications are massive and sparse. In this paper, we will show that massive and sparse graphs have certain theoretical properties which make them susceptible to re-identification attacks. We design a systematic way to exploit these theoretical properties in order to construct re- identification signatures, which are also known as characteristic vectors. These signatures have the property that they are extremely robust to perturbations, especially for massive and sparse graphs. Our results show that even low levels of anonymization require perturbation levels which are significant enough to result in a massive loss of utility. Our experimental results also show that the true anonymization level of graphs is much lower than is implied by measures such as k-anonymity. Thus, the results of this paper establish that the problem of massive graph anonymization has fundamental theoretical barriers which prevent a fully effective solution.
  • Keywords
    data analysis; data mining; digital signatures; graph theory; social networking (online); characteristic vectors; data analysis; data mining; graph anonymization; massive graphs; node re-identification; re-identification attacks; re-identification signatures; social network; sparse graphs; Approximation methods; Couplings; Privacy; Random variables; Robustness; Social network services; Vectors; anonymization; graphs; privacy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver,BC
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4577-2075-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2011.112
  • Filename
    6137305