• DocumentCode
    2141738
  • Title

    A framework for privacy-preserving cluster analysis

  • Author

    Fung, Benjamin C M ; Wang, Ke ; Wang, Lingyu ; Debbabi, Mourad

  • Author_Institution
    CIISE, Concordia Univ., Montreal, QC
  • fYear
    2008
  • fDate
    17-20 June 2008
  • Firstpage
    46
  • Lastpage
    51
  • Abstract
    Releasing person-specific data could potentially reveal sensitive information of individuals. k-anonymization is a promising privacy protection mechanism in data publishing. Though substantial research has been conducted on k-anonymization and its extensions in recent years, few of them consider releasing data for a specific purpose of data analysis. This paper presents a practical data publishing framework for determining a generalized version of data that preserves both individual privacy and information usefulness for cluster analysis. Experiments on real-life data suggest that, by focusing on preserving cluster structure in the generalization process, the cluster quality is significantly better than the cluster quality on the generalized data without such focus. The major challenge of generalizing data for cluster analysis is the lack of class labels that could be used to guide the generalization process. Our approach converts the problem into the counterpart problem for classification analysis where class labels encode the cluster structure in the data, and presents a framework to evaluate the cluster quality on the generalized data.
  • Keywords
    data analysis; data privacy; pattern clustering; statistical analysis; classification analysis; data analysis; data publishing; generalization process; k-anonymization; privacy protection mechanism; privacy-preserving cluster analysis; Data analysis; Data privacy; Diseases; Government; Information analysis; Information security; Internet; National security; Protection; Terrorism;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligence and Security Informatics, 2008. ISI 2008. IEEE International Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    978-1-4244-2414-6
  • Electronic_ISBN
    978-1-4244-2415-3
  • Type

    conf

  • DOI
    10.1109/ISI.2008.4565028
  • Filename
    4565028