DocumentCode :
3128962
Title :
Preventing Identity Disclosure in Hypergraphs
Author :
Li, Yidong ; Shen, Hong
Author_Institution :
Sch. of Comput. & Inf. Technol., Beijing Jiaotong Univ., Beijing, China
fYear :
2011
fDate :
11-11 Dec. 2011
Firstpage :
659
Lastpage :
665
Abstract :
Data publishing based on hyper graphs is becoming increasingly popular due to its power in representing multi-relations among objects. However, security issues have been little studied on this subject, while most recent work only focuses on the protection of relational data or graphs. As a major privacy breach, identity disclosure reveals the identification of entities with certain background knowledge known by an adversary. In this paper, we first introduce a novel background knowledge attack model based on the property of hyper edge ranks, and formalize the rank-based hyper graph anonymization problem. We then propose a complete solution in a two-step framework: rank anonymization and hyper graph construction. We also take hyper graph clustering (known as community detection) as data utility into consideration, and discuss two metrics to quantify information loss incurred in the perturbation. Our approaches are effective in terms of efficacy, privacy and utility. The algorithms run in near-quadratic time on hyper graph size, and protect data from rank attacks with almost same utility preserved. The performances of the methods have been validated by extensive experiments on real-world datasets as well. Our rank-based attack model and algorithms for rank anonymization and hyper graph construction are, to our best knowledge, the first systematic study to privacy preserving for based-based data publishing.
Keywords :
data privacy; graph theory; pattern clustering; security of data; background knowledge attack model; data utility; hyperedge rank property; hypergraph clustering; hypergraph-based data publishing; identity disclosure prevention; near-quadratic time; object multirelation representing; privacy breach; rank-based attack model; rank-based hypergraph anonymization problem; relational data protection; Approximation algorithms; Clustering algorithms; Communities; Facebook; Measurement; Privacy; Anonymization; Community detection; Hypergraph clustering; Identity disclosure; Private data publishing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4673-0005-6
Type :
conf
DOI :
10.1109/ICDMW.2011.139
Filename :
6137443
Link To Document :
بازگشت