DocumentCode
2745337
Title
Information loss evaluation based on fuzzy and crisp clustering of graph statistics
Author
Nettleton, David F.
Author_Institution
Data Privacy Res. Group, Univ. Pompeu Fabra, Bellaterra, Spain
fYear
2012
fDate
10-15 June 2012
Firstpage
1
Lastpage
8
Abstract
In this paper we apply different types of clustering, fuzzy (fuzzy c-Means) and crisp (k-Means) to graph statistical data in order to evaluate information loss due to perturbation as part of the anonymization process for a data privacy application. We make special emphasis on two major node types: hubs, which are nodes with a high relative degree value, and bridges, which act as connecting nodes between different regions in the graph. By clustering the graph´s statistical data before and after perturbation, we can measure the change in characteristics and therefore the information loss. We partition the nodes into three groups: hubs/global bridges, local bridges, and all other nodes. We suspect that the partitions of these nodes are best represented in the fuzzy form, especially in the case of nodes in frontier regions of the graphs which may have an ambiguous assignment.
Keywords
data privacy; fuzzy set theory; graph theory; pattern clustering; statistical analysis; anonymization process; crisp clustering; data privacy application; frontier regions; fuzzy c-means clustering; graph statistical data; high relative degree value; hubs-global bridges; information loss evaluation; k-means clustering; local bridges; node types; Bridges; Communities; Data privacy; Loss measurement; Perturbation methods; Social network services; bridges; clustering; crisp; data privacy; fuzzy; graphs; hubs; perturbation;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems (FUZZ-IEEE), 2012 IEEE International Conference on
Conference_Location
Brisbane, QLD
ISSN
1098-7584
Print_ISBN
978-1-4673-1507-4
Electronic_ISBN
1098-7584
Type
conf
DOI
10.1109/FUZZ-IEEE.2012.6250774
Filename
6250774
Link To Document