DocumentCode
2847156
Title
Top-down specialization for information and privacy preservation
Author
Fung, Benjamin C M ; Wang, Ke ; Yu, Philip S.
Author_Institution
Sch. of Comput. Sci., Simon Fraser Univ., Burnaby, BC, Canada
fYear
2005
fDate
5-8 April 2005
Firstpage
205
Lastpage
216
Abstract
Releasing person-specific data in its most specific state poses a threat to individual privacy. This paper presents a practical and efficient algorithm for determining a generalized version of data that masks sensitive information and remains useful for modelling classification. The generalization of data is implemented by specializing or detailing the level of information in a top-down manner until a minimum privacy requirement is violated. This top-down specialization is natural and efficient for handling both categorical and continuous attributes. Our approach exploits the fact that data usually contains redundant structures for classification. While generalization may eliminate some structures, other structures emerge to help. Our results show that quality of classification can be preserved even for highly restrictive privacy requirements. This work has great applicability to both public and private sectors that share information for mutual benefits and productivity.
Keywords
classification; data privacy; data structures; very large databases; categorical attributes; data generalization; individual privacy protection; information sharing; top-down specialization method; Data analysis; Data mining; Data privacy; Databases; Government; Joining processes; Legislation; Medical diagnostic imaging; Productivity; Protection;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
ISSN
1084-4627
Print_ISBN
0-7695-2285-8
Type
conf
DOI
10.1109/ICDE.2005.143
Filename
1410123
Link To Document