• DocumentCode
    3092537
  • Title

    Towards Identify Anonymization in Large Survey Rating Data

  • Author

    Sun, Xiaoxun ; Wang, Hua

  • Author_Institution
    Dept. of Math. & Comput., Univ. of Southern Queensland, Toowoomba, QLD, Australia
  • fYear
    2010
  • fDate
    1-3 Sept. 2010
  • Firstpage
    99
  • Lastpage
    104
  • Abstract
    We study the challenge of identity protection in the large public survey rating data. Even though the survey participants do not reveal any of their ratings, their survey records are potentially identifiable by using information from other public sources. None of the existing anonymisation principles (e.g., k-anonymity, l-diversity, etc.) can effectively prevent such breaches in large survey rating data sets. In this paper, we tackle the problem by defining the (k, ∈)-anonymity principle. The principle requires for each transaction t in the given survey rating data T, at least (k - 1) other transactions in T must have ratings similar with t, where the similarity is controlled by e. We propose a greedy approach to anonymize survey rating data and apply the method to two real-life data sets to demonstrate their efficiency and practical utility.
  • Keywords
    greedy algorithms; security of data; very large databases; anonymisation principles; greedy approach; identity protection; large survey rating data; Accuracy; Data models; Data privacy; Hamming distance; Motion pictures; Publishing; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network and System Security (NSS), 2010 4th International Conference on
  • Conference_Location
    Melbourne, VIC
  • Print_ISBN
    978-1-4244-8484-3
  • Electronic_ISBN
    978-0-7695-4159-4
  • Type

    conf

  • DOI
    10.1109/NSS.2010.11
  • Filename
    5636092