Title :
Probabilistic k-anonymity through microaggregation and data swapping
Author :
Soria-Comas, Jordi ; Domingo-Ferrer, Josep
Author_Institution :
Dept. of Comput. Eng. & Math., Univ. Rovira i Virgili, Tarragona, Spain
Abstract :
k-Anonymity is a privacy property used to limit the risk of re-identification in a microdata set. A data set satisfying k-anonymity consists of groups of k records which are indistinguishable as far as their quasi-identifier attributes are concerned. Hence, the probability of re-identifying a record within a group is 1/k. We introduce the probabilistic k-anonymity property, which relaxes the indistinguishability requirement of k-anonymity and only requires that the probability of re-identification be the same as in k-anonymity. Two computational heuristics to achieve probabilistic k-anonymity based on data swapping are proposed: MDAV microaggregation on the quasi-identifiers plus swapping, and individual ranking microaggregation on individual confidential attributes plus swapping. We report experimental results, where we compare the utility of original, k-anonymous and probabilistically k-anonymous data.
Keywords :
data privacy; probability; computational heuristics; data swapping; individual confidential attribute; individual ranking microaggregation; microdata set; privacy property; probabilistic k-anonymity property; quasi-identifier attribute; record reidentification probability; reidentification risk; Computational intelligence; Couplings; Data privacy; Privacy; Probabilistic logic; Proposals; Remuneration; Computational intelligence; anonymization; clustering; differential privacy; k-anonymity; microaggregation; statistical disclosure control; swapping;
Conference_Titel :
Fuzzy Systems (FUZZ-IEEE), 2012 IEEE International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1507-4
Electronic_ISBN :
1098-7584
DOI :
10.1109/FUZZ-IEEE.2012.6251280