DocumentCode
570221
Title
Simultaneous pattern and data hiding in supervised learning
Author
Pengpeng Lin ; Jun Zhang ; Xiwei Wang ; Shindhelm, A.
Author_Institution
Comput. Sci. Dept., Univ. of Kentucky, Lexington, KY, USA
fYear
2012
fDate
8-10 Aug. 2012
Firstpage
385
Lastpage
392
Abstract
The ability to hide private data and confidential patterns from potential adversaries while still maintaining data mining value is an important aspect in privacy preserving data mining. In this paper, we study a nonnegative matrix factorization technique, where we show how to define objective functions and derive corresponding multiplicative update functions. We then use that knowledge to propose a data value perturbation scheme that hides data values but still keeps the data pattern to a large degree. Based on the proposed data value perturbation scheme, we develop a dual data hiding scheme which not only hides data but also hides individual sample´s class membership. The essential idea is to use an indicator matrix as a guide for the update process. The performance of the proposed schemes are examined on benchmark datasets for both utility value and data perturbation degree. The empirical results show that the data values are well perturbed and our schemes are capable of hiding a data sample´s class membership without side effects. At the end, we draw some interesting conclusions and layout potential future work.
Keywords
data encapsulation; data mining; data privacy; learning (artificial intelligence); matrix decomposition; perturbation techniques; benchmark datasets; class membership; confidential patterns; data mining value; data pattern; data perturbation degree; data value perturbation scheme; dual data hiding scheme; indicator matrix; multiplicative update functions; nonnegative matrix factorization technique; objective functions; privacy preserving data mining; private data hiding; simultaneous pattern; supervised learning; update process; utility value; Clustering algorithms; Convergence; Data privacy; Linear programming; Matrix decomposition; USA Councils; Vectors; Classification; Indicator Matrix; NMF; PPDM;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Reuse and Integration (IRI), 2012 IEEE 13th International Conference on
Conference_Location
Las Vegas, NV
Print_ISBN
978-1-4673-2282-9
Electronic_ISBN
978-1-4673-2283-6
Type
conf
DOI
10.1109/IRI.2012.6303035
Filename
6303035
Link To Document