• DocumentCode
    13975
  • Title

    On Identity Disclosure Control for Hypergraph-Based Data Publishing

  • Author

    Yidong Li ; Hong Shen

  • Author_Institution
    Sch. of Comput. & Inf. Technol., Beijing Jiaotong Univ., Beijing, China
  • Volume
    8
  • Issue
    8
  • fYear
    2013
  • fDate
    Aug. 2013
  • Firstpage
    1384
  • Lastpage
    1396
  • Abstract
    Data publishing based on hypergraphs is becoming increasingly popular due to its power in representing multirelations among objects. However, security issues have been little studied on this subject, while most recent work only focuses on the protection of relational data or graphs. As a major privacy breach, identity disclosure reveals the identification of entities with certain background knowledge known by an adversary. In this paper, we first introduce a novel background knowledge attack model based on the property of hyperedge ranks, and formalize the rank-based hypergraph anonymization problem. We then propose a complete solution in a two-step framework: rank anonymization and hypergraph reconstruction. We also take hypergraph clustering (known as community detection) as data utility into consideration, and discuss two metrics to quantify information loss incurred in the perturbation. Our approaches are effective in terms of efficacy, privacy, and utility. The algorithms run in near-quadratic time on hypergraph size, and protect data from rank attacks with almost the same utility preserved. The performances of the methods have been validated by extensive experiments on real-world datasets as well. Our rank-based attack model and algorithms for rank anonymization and hypergraph reconstruction are, to our best knowledge, the first systematic study to privacy preserving for hypergraph-based data publishing.
  • Keywords
    graph theory; graphs; security of data; adversary; background knowledge attack model; community detection; data protection; data utility; graphs; hyperedge ranks; hypergraph based data publishing; hypergraph clustering; hypergraph reconstruction; hypergraph size; identity disclosure control; information loss quantification; near quadratic time; perturbation; privacy breach; rank anonymization; rank based attack model; rank based hypergraph anonymization problem; relational data; Clustering algorithms; Communities; Data privacy; Measurement; Privacy; Publishing; Social network services; Anonymization; community detection; hypergraph clustering; identity disclosure; private data publishing;
  • fLanguage
    English
  • Journal_Title
    Information Forensics and Security, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1556-6013
  • Type

    jour

  • DOI
    10.1109/TIFS.2013.2271425
  • Filename
    6548066