DocumentCode :
2190861
Title :
Privacy preserving processing of data decision tree based on sample selection and Singular Value Decomposition
Author :
Jain, Paril ; Pathak, Nagendra ; Tapashetti, Pratibha ; Umesh, A.S.
Author_Institution :
Dept. of CSE, AISECT Univ., Bhopal, India
fYear :
2013
fDate :
4-6 Dec. 2013
Firstpage :
91
Lastpage :
95
Abstract :
Data mining is a set of automated techniques used to extract hidden or buried information from large databases. With the development of data mining technologies, privacy protection has become a challenge for data mining applications in many fields. To solve this problem, many privacy-preserving data mining methods have been proposed. One important type of such methods is based on Singular Value Decomposition (SVD). In the proposed algorithm, attributes are grouped according to their distance difference similarity by clustering the data set using decision tree classification. Secondly, the algorithm packetizes the attributes according to their SA value in each group. Thirdly, for each group it selects attributes from the smallest bucket and searches for a similar attributes in the attributes-1 largest buckets from the same group to create an equivalence class following the unique attribute-distinct diversity anonymization model. The proposed algorithm satisfies the “utility based anonymization principle that crucial information is protected from being suppressed. Also, weights given to attributes improve clustering and give the ability to control the generalization´s depth. In prototype decision tree is combination of clustering and classification technique such methods are called ensemble classifier, this new proposed method is more efficient in balancing data privacy and data utility.
Keywords :
data mining; data privacy; database management systems; decision trees; singular value decomposition; SVD; attributes-1 largest buckets; automated techniques; buried information extraction; data decision tree; data privacy; data set clustering; data utility; decision tree classification; ensemble classifier; hidden information extraction; large databases; privacy preserving processing; privacy-preserving data mining methods; sample selection; singular value decomposition; unique attribute-distinct diversity anonymization model; utility based anonymization principle; Accuracy; Data privacy; Distortion measurement; Manganese; Clustering; Decision Tree; Privacy-Preserving Data Mining; Singular Value Decomposition (SVD);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Assurance and Security (IAS), 2013 9th International Conference on
Conference_Location :
Gammarth
Print_ISBN :
978-1-4799-2989-4
Type :
conf
DOI :
10.1109/ISIAS.2013.6947739
Filename :
6947739
Link To Document :
بازگشت