• DocumentCode
    72427
  • Title

    Anonymizing Collections of Tree-Structured Data

  • Author

    Gkountouna, Olga ; Terrovitis, Manolis

  • Author_Institution
    Dept. of Electr. & Comput. Eng., NTUA, Greece
  • Volume
    27
  • Issue
    8
  • fYear
    2015
  • fDate
    Aug. 1 2015
  • Firstpage
    2034
  • Lastpage
    2048
  • Abstract
    Collections of real-world data usually have implicit or explicit structural relations. For example, databases link records through foreign keys, and XML documents express associations between different values through syntax. Privacy preservation, until now, has focused either on data with a very simple structure, e.g. relational tables, or on data with very complex structure e.g. social network graphs, but has ignored intermediate cases, which are the most frequent in practice. In this work, we focus on tree structured data. Such data stem from various applications, even when the structure is not directly reflected in the syntax, e.g. XML documents. A characteristic case is a database where information about a single person is scattered amongst different tables that are associated through foreign keys. The paper defines k(m;n)-anonymity, which provides protection against identity disclosure and proposes a greedy anonymization heuristic that is able to sanitize large datasets. The algorithm and the quality of the anonymization are evaluated experimentally.
  • Keywords
    XML; data privacy; tree data structures; XML documents; data stem; greedy anonymization heuristic; privacy preservation; relational tables; social network graphs; tree-structured data; Data engineering; Data privacy; Diseases; Hospitals; Lungs; Privacy; Privacy; anonymity; disassociation; generalization; structural knowledge; tree data;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2015.2405563
  • Filename
    7045589