DocumentCode
724728
Title
One Size Doesn´t Fit All: Measuring Individual Privacy in Aggregate Genomic Data
Author
Simmons, Sean ; Berger, Bonnie
Author_Institution
Dept. of Math., Massachusetts Inst. of Technol., Cambridge, MA, USA
fYear
2015
fDate
21-22 May 2015
Firstpage
41
Lastpage
49
Abstract
Even in the aggregate, genomic data can reveal sensitive information about individuals. We present a new model-based measure, PrivMAF, that provides provable privacy guarantees for aggregate data (namely minor allele frequencies) obtained from genomic studies. Unlike many previous measures that have been designed to measure the total privacy lost by all participants in a study, PrivMAF gives an individual privacy measure for each participant in the study, not just an average measure. These individual measures can then be combined to measure the worst case privacy loss in the study. Our measure also allows us to quantify the privacy gains achieved by perturbing the data, either by adding noise or binning. Our findings demonstrate that both perturbation approaches offer significant privacy gains. Moreover, we see that these privacy gains can be achieved while minimizing perturbation (and thus maximizing the utility) relative to stricter notions of privacy, such as differential privacy. We test PrivMAF using genotype data from the Welcome Trust Case Control Consortium, providing a more nuanced understanding of the privacy risks involved in an actual genome-wide association studies. Interestingly, our analysis demonstrates that the privacy implications of releasing MAFs from a study can differ greatly from individual to individual. An implementation of our method is available at http://privmaf.csail.mit.edu.
Keywords
biology computing; data privacy; genomics; PrivMAF; Welcome Trust Case Control Consortium; aggregate genomic data; genome-wide association studies; genomic studies; genotype data; minor allele frequencies; model-based measure; perturbation approaches; privacy gains; privacy loss; privacy measure; privacy risks; provable privacy guarantees; Atmospheric measurements; Bioinformatics; Data privacy; Genomics; Noise; Particle measurements; Privacy; Aggregate Data; Genomic Privacy; Privacy-Preserving Data Publication; Statistical Disclosure Control;
fLanguage
English
Publisher
ieee
Conference_Titel
Security and Privacy Workshops (SPW), 2015 IEEE
Conference_Location
San Jose, CA
Type
conf
DOI
10.1109/SPW.2015.25
Filename
7163207
Link To Document