Title :
Research on Privacy Preserving Distributed C4. 5 Algorithm
Author :
Shen, Yanguang ; Shao, Hui ; Huang, Jianzhong
Author_Institution :
Sch. of Inf. Sci. & Electr. Eng., Hebei Univ. of Eng., Handan, China
Abstract :
This paper studied how two parties collaboratively built a decision tree on the union of their dataset without revealing privacy when dataset is vertically and horizontally distributed. We gave an algorithm of privacy preserving C4.5 which is applicable to vertically and horizontally partitioned dataset, and also gave the detailed computation method of the information gain ratio in the case of without revealing privacy. The secure scalar product protocol, the xln(x) protocol and the secure sum protocol are used in collaborative computing, which can protect privacy effectively.
Keywords :
cryptographic protocols; data privacy; decision trees; groupware; collaborative computing; decision tree; horizontally partitioned dataset; information gain ratio; privacy preserving distributed C4.5 algorithm; secure scalar product protocol; secure sum protocol; vertically partitioned dataset; xln(x) protocol; Concrete; Data engineering; Data mining; Data privacy; Decision trees; Information science; Information technology; Partitioning algorithms; Protection; Protocols; C4.5 decision tree; distributed data mining; privacy preserving; secure multiparty calculation;
Conference_Titel :
Intelligent Information Technology Application Workshops, 2009. IITAW '09. Third International Symposium on
Conference_Location :
Nanchang
Print_ISBN :
978-1-4244-6420-3
Electronic_ISBN :
978-1-4244-6421-0
DOI :
10.1109/IITAW.2009.81