• DocumentCode
    2745337
  • Title

    Information loss evaluation based on fuzzy and crisp clustering of graph statistics

  • Author

    Nettleton, David F.

  • Author_Institution
    Data Privacy Res. Group, Univ. Pompeu Fabra, Bellaterra, Spain
  • fYear
    2012
  • fDate
    10-15 June 2012
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    In this paper we apply different types of clustering, fuzzy (fuzzy c-Means) and crisp (k-Means) to graph statistical data in order to evaluate information loss due to perturbation as part of the anonymization process for a data privacy application. We make special emphasis on two major node types: hubs, which are nodes with a high relative degree value, and bridges, which act as connecting nodes between different regions in the graph. By clustering the graph´s statistical data before and after perturbation, we can measure the change in characteristics and therefore the information loss. We partition the nodes into three groups: hubs/global bridges, local bridges, and all other nodes. We suspect that the partitions of these nodes are best represented in the fuzzy form, especially in the case of nodes in frontier regions of the graphs which may have an ambiguous assignment.
  • Keywords
    data privacy; fuzzy set theory; graph theory; pattern clustering; statistical analysis; anonymization process; crisp clustering; data privacy application; frontier regions; fuzzy c-means clustering; graph statistical data; high relative degree value; hubs-global bridges; information loss evaluation; k-means clustering; local bridges; node types; Bridges; Communities; Data privacy; Loss measurement; Perturbation methods; Social network services; bridges; clustering; crisp; data privacy; fuzzy; graphs; hubs; perturbation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems (FUZZ-IEEE), 2012 IEEE International Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    1098-7584
  • Print_ISBN
    978-1-4673-1507-4
  • Electronic_ISBN
    1098-7584
  • Type

    conf

  • DOI
    10.1109/FUZZ-IEEE.2012.6250774
  • Filename
    6250774