Title :
A quantifying method for trade-off between privacy and utility
Author :
Gu Yonghao ; Wu Weiming
Author_Institution :
Sch. of Comput. Sci., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
Many anonymization methods have been used in data publishing and data mining. In the meantime, they reduce the utility of the dataset. So it is important to consider the tradeoff between privacy and utility. Quantifying the trade-off between usefulness and privacy of dataset has been the subject of much research in recent years. In this paper, we provide the concepts of privacy loss and utility loss and also give a method to quantify them using divergence distance in probability theory. And then, we evaluate our methodology on the Adult dataset from the UCI machine learning repository. Our result shows the relationship between privacy and utility, and also provide data users how to choose the right trade-off between privacy and utility. Finally, we conclude and show the future research direction on how to select best divergence measurement.
Keywords :
data mining; data privacy; learning (artificial intelligence); publishing; security of data; UCI machine learning repository; anonymization methods; data mining; data publishing; divergence distance; divergence measurement; privacy loss; probability theory; quantifying method; utility loss; utility reduction; Divergence; Entropy; Privacy Loss; Trade-off; Utility Loss;
Conference_Titel :
Information and Communications Technologies (IETICT 2013), IET International Conference on
Conference_Location :
Beijing
Electronic_ISBN :
978-1-84919-653-6
DOI :
10.1049/cp.2013.0062