Title :
Privacy preserving C4.5 using Gini index
Author_Institution :
Dept. of Comput. Sc. & Eng., Centurion Univ., Bhubaneswar, India
Abstract :
Now-a-days privacy has become a major concern; the goals of security like confidentiality, integrity and availability do not ensure privacy. Data mining is a threat to privacy. Researchers today focus on how to ensure privacy while performing data mining task. As Data mining algorithms are typically complex and furthermore the input usually consists of massive data sets, the generic protocols in such a case are of no practical use and therefore more efficient protocols are required. This paper focus on the problem of decision tree learning with the popular C4.5 algorithm. C4.5, an extension of ID3 is a very popular decision tree building method in data mining. Entropy and Gini index are two different criteria used in ID3. While there is quite little work in privacy preserving ID3 using entropy and not much has been done for Gini index. This paper propose modified protocols based on secure multiparty computation for privacy preserving C4.5 using Gini index over distributed partitioned data, where the protocols do not require any third party server. However, some communication overhead is necessary so that the parties can carry out the secure protocols. The result like ROC(Receiver Operating characteristic) graph and detail accuracy through cost counting index is shown.
Keywords :
data mining; data privacy; decision trees; entropy; learning (artificial intelligence); protocols; Gini index; cost counting index; data mining task; decision tree building method; decision tree learning; entropy; privacy preserving C4.5 algorithm; privacy preserving ID3; receiver operating characteristic graph; secure multiparty computation; Data privacy; Decision trees; Indexes; Privacy; Protocols; Data mining; ID3/C4.5; data privacy and security;
Conference_Titel :
Emerging Trends and Applications in Computer Science (NCETACS), 2011 2nd National Conference on
Conference_Location :
Shillong
Print_ISBN :
978-1-4244-9578-8
DOI :
10.1109/NCETACS.2011.5751385