مرکز منطقه ای اطلاع رساني علوم و فناوري - Comparison of microaggregation approaches on anonymized data quality

Title of article :

Comparison of microaggregation approaches on anonymized data quality

Author/Authors :

Lin، نويسنده , , Jun-Lin and Chang، نويسنده , , Pei-Chann and Liu، نويسنده , , Julie Yu-Chih and Wen، نويسنده , , Tsung-Hsien، نويسنده ,

Issue Information :

روزنامه با شماره پیاپی سال 2010

Pages :

From page :

8161

To page :

8165

Abstract :

Microaggregation is commonly used to protect microdata from individual identification by anonymizing dataset records such that the resulting dataset (called the anonymized dataset) satisfies the k-anonymity constraint. Since this anonymizing process degrades data quality, an effective microaggregation approach must ensure the quality of the anonymized dataset so that the anonymized dataset remains useful for further analysis. Therefore, the performance of a microaggregation approach should be measured by the quality of the anonymized dataset generated by the microaggregation approach. Previous studies often refer to the quality of an anonymized dataset as information loss. This study takes a different approach. Since an anonymized dataset should support further analysis, this study first builds a classifier from the anonymized dataset, and then uses the prediction accuracy of that classifier to represent the quality of the anonymized dataset. Performance results indicate that low information loss does not necessarily translate into high prediction accuracy, and vice versa. This is particularly true when the information losses of both anonymized datsets do not differ significantly.

Keywords :

Mircroaggregation , Disclosure control , k-anonymity , Information loss

Journal title :

Expert Systems with Applications

Serial Year :

2010

Journal title :

Expert Systems with Applications

Record number :

2348537

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=2348537