DocumentCode
33759
Title
A Random Decision Tree Framework for Privacy-Preserving Data Mining
Author
Vaidya, Jaideep ; Shafiq, Basit ; Wei Fan ; Mehmood, Danish ; Lorenzi, David
Author_Institution
MSIS Dept., Rutgers Univ., Newark, NJ, USA
Volume
11
Issue
5
fYear
2014
fDate
Sept.-Oct. 2014
Firstpage
399
Lastpage
411
Abstract
Distributed data is ubiquitous in modern information driven applications. With multiple sources of data, the natural challenge is to determine how to collaborate effectively across proprietary organizational boundaries while maximizing the utility of collected information. Since using only local data gives suboptimal utility, techniques for privacy-preserving collaborative knowledge discovery must be developed. Existing cryptography-based work for privacy-preserving data mining is still too slow to be effective for large scale data sets to face today´s big data challenge. Previous work on random decision trees (RDT) shows that it is possible to generate equivalent and accurate models with much smaller cost. We exploit the fact that RDTs can naturally fit into a parallel and fully distributed architecture, and develop protocols to implement privacy-preserving RDTs that enable general and efficient distributed privacy-preserving knowledge discovery.
Keywords
data mining; data privacy; decision trees; big data challenge; cryptography-based work; distributed data; distributed privacy-preserving knowledge discovery; organizational boundaries; privacy-preserving RDT; privacy-preserving collaborative knowledge discovery; privacy-preserving data mining; random decision tree framework; suboptimal utility; Data mining; Decision trees; Encryption; Protocols; Vectors; Vegetation; Privacy-preserving data mining; classification;
fLanguage
English
Journal_Title
Dependable and Secure Computing, IEEE Transactions on
Publisher
ieee
ISSN
1545-5971
Type
jour
DOI
10.1109/TDSC.2013.43
Filename
6616536
Link To Document