DocumentCode
1072
Title
Supporting Pattern-Preserving Anonymization for Time-Series Data
Author
Lidan Shou ; Xuan Shang ; Ke Chen ; Gang Chen ; Chao Zhang
Author_Institution
Coll. of Comput. Sci., Zhejiang Univ., Hangzhou, China
Volume
25
Issue
4
fYear
2013
fDate
Apr-13
Firstpage
877
Lastpage
892
Abstract
Time series is an important form of data available in numerous applications and often contains vast amount of personal privacy. The need to protect privacy in time-series data while effectively supporting complex queries on them poses nontrivial challenges to the database community. We study the anonymization of time series while trying to support complex queries, such as range and pattern matching queries, on the published data. The conventional k-anonymity model cannot effectively address this problem as it may suffer severe pattern loss. We propose a novel anonymization model called (k, P)-anonymity for pattern-rich time series. This model publishes both the attribute values and the patterns of time series in separate data forms. We demonstrate that our model can prevent linkage attacks on the published data while effectively support a wide variety of queries on the anonymized data. We propose two algorithms to enforce (k, P)-anonymity on time-series data. Our anonymity model supports customized data publishing, which allows a certain part of the values but a different part of the pattern of the anonymized time series to be published simultaneously. We present estimation techniques to support query processing on such customized data. The proposed methods are evaluated in a comprehensive experimental study. Our results verify the effectiveness and efficiency of our approach.
Keywords
data privacy; query processing; time series; (k,P)-anonymity; attribute values; complex queries; customized data; data anonymization; database community; k-anonymity model; linkage attack prevention; pattern matching queries; pattern-preserving anonymization; pattern-rich time series; personal privacy protection; published data; query processing; time-series data; Correlation; Couplings; Data models; Data privacy; Databases; Pattern matching; Publishing; Privacy; anonymity; pattern; time series;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2011.249
Filename
6095556
Link To Document