Title :
Modeling and Integrating Background Knowledge in Data Anonymization
Author :
Li, Tiancheng ; Li, Ninghui ; Zhang, Jian
Author_Institution :
Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN
fDate :
March 29 2009-April 2 2009
Abstract :
Recent work has shown the importance of considering the adversary´s background knowledge when reasoning about privacy in data publishing. However, it is very difficult for the data publisher to know exactly the adversary´s background knowledge. Existing work cannot satisfactorily model background knowledge and reason about privacy in the presence of such knowledge. This paper presents a general framework for modeling the adversary´s background knowledge using kernel estimation methods. This framework subsumes different types of knowledge (e.g., negative association rules) that can be mined from the data. Under this framework, we reason about privacy using Bayesian inference techniques and propose the skyline (B, t)-privacy model, which allows the data publisher to enforce privacy requirements to protect the data against adversaries with different levels of background knowledge. Through an extensive set of experiments, we show the effects of probabilistic background knowledge in data anonymization and the effectiveness of our approach in both privacy protection and utility preservation.
Keywords :
data privacy; estimation theory; inference mechanisms; publishing; adversary background knowledge; data anonymization; data publishing; kernel estimation methods; privacy protection; skyline (B, t)-privacy model; using Bayesian inference techniques; Association rules; Cancer; Data engineering; Data privacy; Diseases; Hospitals; Influenza; Lungs; Protection; Statistics; Anonymity; Data Privacy; Data Security; Kernel Estimation;
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
DOI :
10.1109/ICDE.2009.86