Title :
Reducing Amount of Information Loss in k-Anonymization for Secondary Use of Collected Personal Information
Author :
Harada, Kunihiko ; Sato, Yoshinori ; Togashi, Yumiko
Author_Institution :
Yokohama Res. Lab., Hitachi, Yokohama, Japan
Abstract :
A lot of information has recently been collected and the need to put it to secondary use is expanding. This is because a lot of useful knowledge is contained in it. There are always privacy concerns with the secondary use of personal information. k-anonymization is a tool that enables us to release personal information in a manner that is privacy-protected. In classical k-anonymization, side information, which is termed generalization hierarchies, is always needed. In addition, the quality of k-anonymized data has always been a central problem in the area because information loss is an inherent feature of anonymization. This paper proposes a new scheme in which generalization hierarchies are automatically constructed by input information. This scheme not only contributes to reducing the cost of operations for preparing side information, but also to increasing the quality of k-anonymization results. Experiments have demonstrated that k-anonymization with automatically constructed hierarchies sacrifices 38% less data (measured by information entropy) than that with complete binary trees (introduced as classically-used hierarchies).
Keywords :
data privacy; collected personal information; information loss; input information; k-anonymization; k-anonymized data; privacy protection; Binary trees; Clustering algorithms; Information entropy; Measurement; Outsourcing; Privacy; k-anonymization; personal information; secondary use;
Conference_Titel :
SRII Global Conference (SRII), 2012 Annual
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4673-2318-5
Electronic_ISBN :
2166-0778
DOI :
10.1109/SRII.2012.18