DocumentCode :
3195673
Title :
Using aggregate human genome data for individual identification
Author :
Yue Wang ; Xintao Wu ; Xinghua Shi
Author_Institution :
Coll. of Comput. & Inf., Univ. of North Carolina at Charlotte, Charlotte, NC, USA
fYear :
2013
fDate :
18-21 Dec. 2013
Firstpage :
410
Lastpage :
415
Abstract :
Data privacy in genome-wide association studies (GWAS) is a critical yet under-exploited research area. In this paper, we first provide a method to construct a two-layered bayesian network explicitly revealing the conditional dependency between SNPs and traits, from the public GWAS catalog. Then we develop efficient algorithms for two attacks: identity inference attack and trait inference attack based on reasoning with the dependency relationship captured in the constructed bayesian network. Different from previously proposed attacks, the possible target of our attacks may be any common people, not limited to GWAS participants. The empirical evaluations show that unprotected statistics released from GWAS can be exploited by attackers to identify individual or derive private information. Thus we show that mining GWAS statistics threatens the privacy of a much wider population and privacy protection mechanisms should be employed.
Keywords :
belief networks; bioinformatics; data privacy; genomics; aggregate human genome data; conditional dependency; data privacy; genome wide association studies; individual identification; public GWAS catalog; trait inference attack; two layered Bayesian network; Bayes methods; Bioinformatics; Catalogs; Diseases; Frequency control; Genomics; Bayesian network; Genome-wide association study; privacy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
Conference_Location :
Shanghai
Type :
conf
DOI :
10.1109/BIBM.2013.6732527
Filename :
6732527
Link To Document :
بازگشت