• DocumentCode
    1369031
  • Title

    Privacy Preserving Decision Tree Learning Using Unrealized Data Sets

  • Author

    Fong, Pui K. ; Weber-Jahnke, Jens H.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Victoria, New Westminster, BC, Canada
  • Volume
    24
  • Issue
    2
  • fYear
    2012
  • Firstpage
    353
  • Lastpage
    364
  • Abstract
    Privacy preservation is important for machine learning and data mining, but measures designed to protect private information often result in a trade-off: reduced utility of the training samples. This paper introduces a privacy preserving approach that can be applied to decision tree learning, without concomitant loss of accuracy. It describes an approach to the preservation of the privacy of collected data samples in cases where information from the sample database has been partially lost. This approach converts the original sample data sets into a group of unreal data sets, from which the original samples cannot be reconstructed without the entire group of unreal data sets. Meanwhile, an accurate decision tree can be built directly from those unreal data sets. This novel approach can be applied directly to the data storage as soon as the first sample is collected. The approach is compatible with other privacy preserving approaches, such as cryptography, for extra protection.
  • Keywords
    cryptography; data mining; data privacy; decision trees; learning (artificial intelligence); cryptography; data mining; data storage; machine learning; privacy preservation; privacy preserving decision tree learning; Classification; Cryptography; Data mining; Data privacy; Decision trees; Information security; Machine learning; Privacy; Classification; data mining; machine learning; security and privacy protection.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2010.226
  • Filename
    5620916