DocumentCode :
2741348
Title :
Privacy Preserving Clustering by Cluster Bulging for Information Sustenance
Author :
Kadampur, Mohammad Ali ; Somayajulu, D.V.L.N. ; Dhiraj, S. S Shivaji ; Satyam, Shailesh G P
Author_Institution :
Dept. of Comput. Sci. & Eng., Nat. Inst. of Technol., Warangal
fYear :
2008
fDate :
12-14 Dec. 2008
Firstpage :
240
Lastpage :
246
Abstract :
Cluster analysis is a data mining approach for unsupervised learning. However, the use of clustering as a data mining tool has been a cause of growing concern as the use of this technology is violating individual privacy. This paper presents a method for privacy preserving clustering through cluster bulging. In this method, the objects of the database are first aligned into clusters based on a similarity measure. The data in these clusters is perturbed in a controlled manner by modifying the values of various objects, so that, in the perturbed data set, the clusters are bulged in comparison to those in the original data set. In order to perform this perturbation, every cluster is displaced along the line joining its centroid to the centroid of the whole data set. And, then, every object in each cluster is shifted along the line joining that object to the centroid of the cluster. The word bulging used here refers to both positive and negative bulging. The method in essence manipulates the similarity measures and recomputes the new perturbed objects of the respective clusters. Thus, every object in the bulged cluster represents its corresponding object from the original cluster. After the application of this method, the objects get perturbed, while the number of member objects and shape of each cluster remain the same as those of the original clusters, thereby the information in the two instances of the data sets is sustained, while, the privacy of sensitive data is preserved.
Keywords :
data mining; data privacy; database management systems; unsupervised learning; cluster analysis; cluster bulging; data mining; database objects; information sustenance; perturbation; perturbed objects; privacy preserving clustering; sensitive data; similarity measure; unsupervised learning; Algorithm design and analysis; Clustering algorithms; Computer science; Data analysis; Data mining; Data privacy; Databases; Information analysis; Shape; Unsupervised learning; cluster analysis; data mining; data perturbation; information revealing; privacy preservation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Automation for Sustainability, 2008. ICIAFS 2008. 4th International Conference on
Conference_Location :
Colombo
Print_ISBN :
978-1-4244-2899-1
Electronic_ISBN :
978-1-4244-2900-4
Type :
conf
DOI :
10.1109/ICIAFS.2008.4783947
Filename :
4783947
Link To Document :
بازگشت