DocumentCode :
2369197
Title :
On the privacy preserving properties of random data perturbation techniques
Author :
Kargupta, Hillol ; Datta, Souptik ; Wang, Qi ; Sivakumar, Krishnamoorthy
Author_Institution :
Dept. of Comput. Sci. & Electr. Eng., Univ. of Maryland Baltimore County, MD, USA
fYear :
2003
fDate :
19-22 Nov. 2003
Firstpage :
99
Lastpage :
106
Abstract :
Privacy is becoming an increasingly important issue in many data mining applications. This has triggered the development of many privacy-preserving data mining techniques. A large fraction of them use randomized data distortion techniques to mask the data for preserving the privacy of sensitive data. This methodology attempts to hide the sensitive data by randomly modifying the data values often using additive noise. We question the utility of the random value distortion technique in privacy preservation. We note that random objects (particularly random matrices) have "predictable" structures in the spectral domain and it develops a random matrix-based spectral filtering technique to retrieve original data from the dataset distorted by adding random values. We present the theoretical foundation of this filtering method and extensive experimental results to demonstrate that in many cases random data distortion preserve very little data privacy. We also point out possible avenues for the development of new privacy-preserving data mining techniques like exploiting multiplicative and colored noise for preserving privacy in data mining applications.
Keywords :
data mining; data privacy; perturbation techniques; random noise; colored noise; data perturbation technique; data privacy; matrix-based spectral filtering technique; multiplicative noise; privacy-preserving data mining technique; random noise; random objects; randomized data distortion technique; Additive noise; Application software; Colored noise; Computer science; Data mining; Data privacy; Filtering; Information retrieval; Perturbation methods; Telecommunication traffic;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
Type :
conf
DOI :
10.1109/ICDM.2003.1250908
Filename :
1250908
Link To Document :
بازگشت