DocumentCode :
3645311
Title :
Privacy Preserving GWAS Data Sharing
Author :
Stephen E. Fienberg;Aleksandra Slavkovic;Caroline Uhler
Author_Institution :
Dept. of Stat., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2011
Firstpage :
628
Lastpage :
635
Abstract :
Traditional statistical methods for the confidentiality protection for statistical databases do not scale well to deal with GWAS (genome-wide association studies) databases and external information on them. The more recent concept of differential privacy, introduced by the cryptographic community, is an approach which provides a rigorous definition of privacy with meaningful privacy guarantees in the presence of arbitrary external information. Building on such notions, we propose new methods to release aggregate GWAS data without compromising an individual´s privacy. We present methods for releasing differentially private minor allele frequencies, chi-square statistics and p-values. We compare these approaches on simulated data and on a GWAS study of canine hair length involving 685 dogs. We also propose a privacy-preserving method for finding genome-wide associations based on a differentially private approach to penalized logistic regression.
Keywords :
"Privacy","Sensitivity","Noise","Hair","Dogs","Genomics","Bioinformatics"
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Print_ISBN :
978-1-4673-0005-6
Type :
conf
DOI :
10.1109/ICDMW.2011.140
Filename :
6137439
Link To Document :
بازگشت