• DocumentCode
    735896
  • Title

    Disclosure risk assessment via record linkage by a maximum-knowledge attacker

  • Author

    Domingo-Ferrer, Josep ; Ricci, Sara ; Soria-Comas, Jordi

  • Author_Institution
    Dept. of Comput. Eng. & Math., Univ. Rovira i Virgili, Tarragona, Spain
  • fYear
    2015
  • fDate
    21-23 July 2015
  • Firstpage
    28
  • Lastpage
    35
  • Abstract
    Before releasing an anonymized data set, the data protector must know how safe the data set is, that is, how much disclosure risk is incurred by the release. If no privacy model is used to select specific privacy guarantees prior to anonymization, posterior disclosure risk assessment must be performed based on the anonymized data set and, if the result is not satisfactory, anonymization must be repeated with stricter privacy parameters. Even if a privacy model is used, it may still be advisable to empirically evaluate disclosure on the anonymized data set, especially if the privacy model parameters have been relaxed to improve data utility. Record linkage is a general methodology to posterior disclosure risk assessment, whereby the data protector attempts to recreate the attacker´s re-identification scenario. An important limitation of record linkage is that it usually requires the data protector to make restrictive assumptions on the attacker´s background knowledge. To overcome this limitation, we present a maximum-knowledge attacker model and then we specify and compare several record linkage tests for such a worst-case attacker. Our tests are based on comparing the distribution of linkage distances between the original and the anonymized data set with the distribution of distances between one of the two previous data sets and one random data set. The more similar the distributions, the more plausibly deniable are record linkages claimed by an attacker. Because attaining zero disclosure risk for all records is too costly in terms of utility, a less demanding alternative is presented whose goal is to reduce the maximum per-record disclosure risk.
  • Keywords
    data protection; optimisation; risk management; security of data; data protector; disclosure risk assessment; maximum-knowledge attacker; privacy model; record linkage; Couplings; Data models; Data privacy; Dictionaries; Noise; Privacy; Risk management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Privacy, Security and Trust (PST), 2015 13th Annual Conference on
  • Conference_Location
    Izmir
  • Type

    conf

  • DOI
    10.1109/PST.2015.7232951
  • Filename
    7232951