• DocumentCode
    1985704
  • Title

    Privacy Preserving Data Publishing for Recommender System

  • Author

    Chen, Xiaoqiang ; Huang, Vincent

  • Author_Institution
    Dept. of Inf. Technol., Uppsala Univ., Uppsala, Sweden
  • fYear
    2012
  • fDate
    16-20 July 2012
  • Firstpage
    128
  • Lastpage
    133
  • Abstract
    Driven by mutual benefits, exchange and publication of data among various parties is an inevitable trend. However, released data often contains sensitive user information thus direct publication violates individual privacy. Among many privacy models, k-anonymity framework is popular and well-studied, it protects information by constructing groups of anonymous records such that each record in the table released is covered by no fewer than k-1 other records. In this paper, we first investigate different privacy preserving technologies and then focus on achieving k-anonymity for large scale and sparse databases, especially recommender systems. We present a general process for anonymization of large scale database. A preprocessing phase strategically extracts preference matrix from original data by Singular Value Decomposition (SVD) and eliminates the high dimensionality and sparsity problem. We developed a new clustering based k-anonymity heuristic named Bisecting K-Gather (BKG) and it is proven to be efficient and accurate. To support customized user privacy assignments, we also proposed a new concept called customized k-anonymity along with a corresponding algorithm (BOKG). We use MovieLens database to assess our algorithms. The results show that we can efficiently release anonymized data without compromising the utility of data.
  • Keywords
    data privacy; database management systems; electronic data interchange; matrix algebra; pattern clustering; publishing; recommender systems; singular value decomposition; BKG; MovieLens database; SVD; anonymous records; bisecting k-gather; clustering based k-anonymity heuristic; customized user privacy assignments; data exchange; individual privacy; k-anonymity framework; large scale database; preference matrix; privacy preserving data publishing; recommender system; sensitive user information; singular value decomposition; sparse databases; Clustering algorithms; Data privacy; Databases; Feature extraction; Motion pictures; Privacy; Publishing; Bisecting K-Gather; Bisecting One-K-Gather; Customized K-Anonymity; K-Anonymity; Privacy Preserving Data Publishing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Software and Applications Conference Workshops (COMPSACW), 2012 IEEE 36th Annual
  • Conference_Location
    Izmir
  • Print_ISBN
    978-1-4673-2714-5
  • Electronic_ISBN
    978-0-7695-4758-9
  • Type

    conf

  • DOI
    10.1109/COMPSACW.2012.33
  • Filename
    6341563