• DocumentCode
    570221
  • Title

    Simultaneous pattern and data hiding in supervised learning

  • Author

    Pengpeng Lin ; Jun Zhang ; Xiwei Wang ; Shindhelm, A.

  • Author_Institution
    Comput. Sci. Dept., Univ. of Kentucky, Lexington, KY, USA
  • fYear
    2012
  • fDate
    8-10 Aug. 2012
  • Firstpage
    385
  • Lastpage
    392
  • Abstract
    The ability to hide private data and confidential patterns from potential adversaries while still maintaining data mining value is an important aspect in privacy preserving data mining. In this paper, we study a nonnegative matrix factorization technique, where we show how to define objective functions and derive corresponding multiplicative update functions. We then use that knowledge to propose a data value perturbation scheme that hides data values but still keeps the data pattern to a large degree. Based on the proposed data value perturbation scheme, we develop a dual data hiding scheme which not only hides data but also hides individual sample´s class membership. The essential idea is to use an indicator matrix as a guide for the update process. The performance of the proposed schemes are examined on benchmark datasets for both utility value and data perturbation degree. The empirical results show that the data values are well perturbed and our schemes are capable of hiding a data sample´s class membership without side effects. At the end, we draw some interesting conclusions and layout potential future work.
  • Keywords
    data encapsulation; data mining; data privacy; learning (artificial intelligence); matrix decomposition; perturbation techniques; benchmark datasets; class membership; confidential patterns; data mining value; data pattern; data perturbation degree; data value perturbation scheme; dual data hiding scheme; indicator matrix; multiplicative update functions; nonnegative matrix factorization technique; objective functions; privacy preserving data mining; private data hiding; simultaneous pattern; supervised learning; update process; utility value; Clustering algorithms; Convergence; Data privacy; Linear programming; Matrix decomposition; USA Councils; Vectors; Classification; Indicator Matrix; NMF; PPDM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration (IRI), 2012 IEEE 13th International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    978-1-4673-2282-9
  • Electronic_ISBN
    978-1-4673-2283-6
  • Type

    conf

  • DOI
    10.1109/IRI.2012.6303035
  • Filename
    6303035