DocumentCode
2809503
Title
On Data Distortion for Privacy Preserving Data Mining
Author
Kabir, Saif M A ; Youssef, Amr M. ; Elhakeem, Ahmed K.
Author_Institution
Concordia Univ., Montreal
fYear
2007
fDate
22-26 April 2007
Firstpage
308
Lastpage
311
Abstract
Because of the increasing ability to trace and collect large amount of personal information, privacy preserving in data mining applications has become an important concern. Data perturbation is one of the well known techniques for privacy preserving data mining. The objective of these data perturbation techniques is to distort the individual data values while preserving the underlying statistical distribution properties. Theses data perturbation techniques are usually assessed in terms of both their privacy parameters as well as its associated utility measure. While the privacy parameters present the ability of these techniques to hide the original data values, the data utility measures assess whether the dataset keeps the performance of data mining techniques after the data distortion. In this paper, we investigate the use of truncated non-negative matrix factorization (NMF) with sparseness constraints for data perturbation.
Keywords
data mining; data privacy; matrix decomposition; security of data; sparse matrices; statistical distributions; data distortion; data perturbation; data utility measure; personal information; privacy preserving data mining; sparseness constraints; statistical distribution; truncated nonnegative matrix factorization; Application software; Data engineering; Data mining; Data privacy; Distortion measurement; Euclidean distance; Information systems; Perturbation methods; Statistical distributions; Systems engineering and theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Computer Engineering, 2007. CCECE 2007. Canadian Conference on
Conference_Location
Vancouver, BC
ISSN
0840-7789
Print_ISBN
1-4244-1020-7
Electronic_ISBN
0840-7789
Type
conf
DOI
10.1109/CCECE.2007.83
Filename
4232742
Link To Document