DocumentCode
1985704
Title
Privacy Preserving Data Publishing for Recommender System
Author
Chen, Xiaoqiang ; Huang, Vincent
Author_Institution
Dept. of Inf. Technol., Uppsala Univ., Uppsala, Sweden
fYear
2012
fDate
16-20 July 2012
Firstpage
128
Lastpage
133
Abstract
Driven by mutual benefits, exchange and publication of data among various parties is an inevitable trend. However, released data often contains sensitive user information thus direct publication violates individual privacy. Among many privacy models, k-anonymity framework is popular and well-studied, it protects information by constructing groups of anonymous records such that each record in the table released is covered by no fewer than k-1 other records. In this paper, we first investigate different privacy preserving technologies and then focus on achieving k-anonymity for large scale and sparse databases, especially recommender systems. We present a general process for anonymization of large scale database. A preprocessing phase strategically extracts preference matrix from original data by Singular Value Decomposition (SVD) and eliminates the high dimensionality and sparsity problem. We developed a new clustering based k-anonymity heuristic named Bisecting K-Gather (BKG) and it is proven to be efficient and accurate. To support customized user privacy assignments, we also proposed a new concept called customized k-anonymity along with a corresponding algorithm (BOKG). We use MovieLens database to assess our algorithms. The results show that we can efficiently release anonymized data without compromising the utility of data.
Keywords
data privacy; database management systems; electronic data interchange; matrix algebra; pattern clustering; publishing; recommender systems; singular value decomposition; BKG; MovieLens database; SVD; anonymous records; bisecting k-gather; clustering based k-anonymity heuristic; customized user privacy assignments; data exchange; individual privacy; k-anonymity framework; large scale database; preference matrix; privacy preserving data publishing; recommender system; sensitive user information; singular value decomposition; sparse databases; Clustering algorithms; Data privacy; Databases; Feature extraction; Motion pictures; Privacy; Publishing; Bisecting K-Gather; Bisecting One-K-Gather; Customized K-Anonymity; K-Anonymity; Privacy Preserving Data Publishing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Software and Applications Conference Workshops (COMPSACW), 2012 IEEE 36th Annual
Conference_Location
Izmir
Print_ISBN
978-1-4673-2714-5
Electronic_ISBN
978-0-7695-4758-9
Type
conf
DOI
10.1109/COMPSACW.2012.33
Filename
6341563
Link To Document