DocumentCode
3322399
Title
A Hybrid Approach to Private Record Linkage
Author
Inan, Ali ; Kantarcioglu, Murat ; Bertino, Elisa ; Scannapieco, Monica
Author_Institution
Dept. of Comput. Sci., Univ. of Texas at Dallas, Richardson, TX
fYear
2008
fDate
7-12 April 2008
Firstpage
496
Lastpage
505
Abstract
Real-world entities are not always represented by the same set of features in different data sets. Therefore matching and linking records corresponding to the same real-world entity distributed across these data sets is a challenging task. If the data sets contain private information, the problem becomes even harder due to privacy concerns. Existing solutions of this problem mostly follow two approaches: sanitization techniques and cryptographic techniques. The former achieves privacy by perturbing sensitive data at the expense of degrading matching accuracy. The later, on the other hand, attains both privacy and high accuracy under heavy communication and computation costs. In this paper, we propose a method that combines these two approaches and enables users to trade off between privacy, accuracy and cost. Experiments conducted on real data sets show that our method has significantly lower costs than cryptographic techniques and yields much more accurate matching results compared to sanitization techniques, even when the data sets are perturbed extensively.
Keywords
cryptography; data privacy; cryptographic technique; data anonymization technique; data privacy; distributed real-world entity data set; private record linkage; sanitization technique; Computer science; Costs; Couplings; Cryptographic protocols; Cryptography; Data privacy; Hospitals; Joining processes; Protection; Sliding mode control;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location
Cancun
Print_ISBN
978-1-4244-1836-7
Electronic_ISBN
978-1-4244-1837-4
Type
conf
DOI
10.1109/ICDE.2008.4497458
Filename
4497458
Link To Document