Title :
Privacy-Preserving Statistical Analysis by Exact Logistic Regression
Author :
Duverle, David A. ; Kawasaki, Shohei ; Yamada, Yoshiji ; Sakuma, Jun ; Tsuda, Koji
Author_Institution :
Grad. Sch. of Frontier Sci., Univ. of Tokyo, Kashiwa, Japan
Abstract :
Logistic regression is the method of choice in most genome-wide association studies (GWAS). Due to the heavy cost of performing iterative parameter updates when training such a model, existing methods have prohibitive communication and computational complexities that make them unpractical for real-life usage. We propose a new sampling-based secure protocol to compute exact statistics, that requires a constant number of communication rounds and a much lower number of computations. The publicly available implementation of our protocol (and its many optional optimisations adapted to different security scenarios) can, in a matter of hours, perform statistical testing of over 600 SNP variables across thousands of patients while accounting for potential confounding factors in the clinical data.
Keywords :
biology computing; data privacy; genomics; protocols; regression analysis; sampling methods; security of data; GWAS; genome-wide association study; logistic regression; privacy-preserving statistical analysis; sampling-based secure protocol; Bioinformatics; Computational modeling; Encryption; Logistics; Protocols; GWAS; SNP; exact statistics; logistic regression; secure statistical testing;
Conference_Titel :
Security and Privacy Workshops (SPW), 2015 IEEE
Conference_Location :
San Jose, CA
DOI :
10.1109/SPW.2015.14