• DocumentCode
    2918857
  • Title

    Addressing Accuracy Issues in Privacy Preserving Data Mining through Matrix Factorization

  • Author

    Wang, Jie ; Zhang, Jun

  • Author_Institution
    Univ. of Kentucky, Lexington
  • fYear
    2007
  • fDate
    23-24 May 2007
  • Firstpage
    217
  • Lastpage
    220
  • Abstract
    Maintaining data mining accuracy on distorted datasets is an important issue in privacy preserving data mining. Using matrix approximation, we propose several efficient and flexible techniques to address this issue, and utilize unique characteristics of matrix factorization to maintain data pattern. We use the support vector machine classification to compare accuracy maintenance after data distortion by different methods. With better performance than some classical data perturbation approaches, nonnegative matrix factorization and singular value decomposition are considered to be promising techniques for privacy preserving data mining. Experimental results demonstrate that mining accuracy on the distorted data used these methods is almost as good as that on the original data, with added property of privacy preservation. It indicates that the matrix factorization-based data distortion schemes perturb only confidential attributes to meet privacy requirements while preserving general data pattern for knowledge extraction.
  • Keywords
    data mining; data privacy; singular value decomposition; support vector machines; knowledge extraction; matrix approximation; matrix factorization-based data distortion scheme; privacy preserving data mining; singular value decomposition; support vector machine classification; Additive noise; Classification algorithms; Computer simulation; Data mining; Data privacy; Laboratories; Matrix decomposition; Scientific computing; Support vector machine classification; Support vector machines; data mining; matrix factorization; nonnegative matrix factorization; privacy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligence and Security Informatics, 2007 IEEE
  • Conference_Location
    New Brunswick, NJ
  • Electronic_ISBN
    1-4244-1329-X
  • Type

    conf

  • DOI
    10.1109/ISI.2007.379474
  • Filename
    4258700