مرکز منطقه ای اطلاع رساني علوم و فناوري - Privacy preserving processing of data decision tree based on sample selection and Singular Value Decomposition

DocumentCode :

2190861

Title :

Privacy preserving processing of data decision tree based on sample selection and Singular Value Decomposition

Author :

Jain, Paril ; Pathak, Nagendra ; Tapashetti, Pratibha ; Umesh, A.S.

Author_Institution :

Dept. of CSE, AISECT Univ., Bhopal, India

fYear :

2013

fDate :

4-6 Dec. 2013

Firstpage :

Lastpage :

Abstract :

Data mining is a set of automated techniques used to extract hidden or buried information from large databases. With the development of data mining technologies, privacy protection has become a challenge for data mining applications in many fields. To solve this problem, many privacy-preserving data mining methods have been proposed. One important type of such methods is based on Singular Value Decomposition (SVD). In the proposed algorithm, attributes are grouped according to their distance difference similarity by clustering the data set using decision tree classification. Secondly, the algorithm packetizes the attributes according to their SA value in each group. Thirdly, for each group it selects attributes from the smallest bucket and searches for a similar attributes in the attributes-1 largest buckets from the same group to create an equivalence class following the unique attribute-distinct diversity anonymization model. The proposed algorithm satisfies the “utility based anonymization principle that crucial information is protected from being suppressed. Also, weights given to attributes improve clustering and give the ability to control the generalization´s depth. In prototype decision tree is combination of clustering and classification technique such methods are called ensemble classifier, this new proposed method is more efficient in balancing data privacy and data utility.

Keywords :

data mining; data privacy; database management systems; decision trees; singular value decomposition; SVD; attributes-1 largest buckets; automated techniques; buried information extraction; data decision tree; data privacy; data set clustering; data utility; decision tree classification; ensemble classifier; hidden information extraction; large databases; privacy preserving processing; privacy-preserving data mining methods; sample selection; singular value decomposition; unique attribute-distinct diversity anonymization model; utility based anonymization principle; Accuracy; Data privacy; Distortion measurement; Manganese; Clustering; Decision Tree; Privacy-Preserving Data Mining; Singular Value Decomposition (SVD);

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Information Assurance and Security (IAS), 2013 9th International Conference on

Conference_Location :

Gammarth

Print_ISBN :

978-1-4799-2989-4

Type :

conf

DOI :

10.1109/ISIAS.2013.6947739

Filename :

6947739

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2190861