DocumentCode
1369031
Title
Privacy Preserving Decision Tree Learning Using Unrealized Data Sets
Author
Fong, Pui K. ; Weber-Jahnke, Jens H.
Author_Institution
Dept. of Comput. Sci., Univ. of Victoria, New Westminster, BC, Canada
Volume
24
Issue
2
fYear
2012
Firstpage
353
Lastpage
364
Abstract
Privacy preservation is important for machine learning and data mining, but measures designed to protect private information often result in a trade-off: reduced utility of the training samples. This paper introduces a privacy preserving approach that can be applied to decision tree learning, without concomitant loss of accuracy. It describes an approach to the preservation of the privacy of collected data samples in cases where information from the sample database has been partially lost. This approach converts the original sample data sets into a group of unreal data sets, from which the original samples cannot be reconstructed without the entire group of unreal data sets. Meanwhile, an accurate decision tree can be built directly from those unreal data sets. This novel approach can be applied directly to the data storage as soon as the first sample is collected. The approach is compatible with other privacy preserving approaches, such as cryptography, for extra protection.
Keywords
cryptography; data mining; data privacy; decision trees; learning (artificial intelligence); cryptography; data mining; data storage; machine learning; privacy preservation; privacy preserving decision tree learning; Classification; Cryptography; Data mining; Data privacy; Decision trees; Information security; Machine learning; Privacy; Classification; data mining; machine learning; security and privacy protection.;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2010.226
Filename
5620916
Link To Document