Title :
Algorithms for balancing privacy and knowledge discovery in association rule mining
Author :
Oliveira, Stanley R M ; Zaïane, Osmar R.
Author_Institution :
Embrapa Informatica Agropecuaria, Campinas, Brazil
Abstract :
The discovery of association rules from large databases has proven beneficial for companies since such rules can be very effective in revealing actionable knowledge that leads to strategic decisions. In tandem with this benefit, association rule mining can also pose a threat to privacy protection. The main problem is that from non-sensitive information or unclassified data, one is able to infer sensitive information, including personal information, facts, or even patterns that are not supposed to be disclosed. This scenario reveals a pressing need for techniques that ensure privacy protection, while facilitating proper information accuracy and mining. In this paper, we introduce new algorithms for balancing privacy and knowledge discovery in association rule mining. We show that our algorithms require only two scans, regardless of the database size and the number of restrictive association rules that must be protected. Our performance study compares the effectiveness and scalability of the proposed algorithms and analyzes the fraction of association rules, which are preserved after sanitizing a database. We also report the main results of our performance evaluation and discuss some open research issues.
Keywords :
data analysis; data mining; data privacy; very large databases; association rule mining; data mining technology; database sanitation; database scanning; database size; information accuracy; knowledge discovery; large database; performance evaluation; privacy preservation; privacy protection; restrictive association rule protection; sensitive information; strategic decision; unclassified data; Association rules; Data mining; Data privacy; Information security; Pattern analysis; Performance analysis; Pressing; Protection; Scalability; Transaction databases;
Conference_Titel :
Database Engineering and Applications Symposium, 2003. Proceedings. Seventh International
Print_ISBN :
0-7695-1981-4
DOI :
10.1109/IDEAS.2003.1214911