Title :
Discrimination prevention in data mining for intrusion and crime detection
Author :
Hajian, Sara ; Domingo-Ferrer, Josep ; Martínez-Ballesté, Antoni
Author_Institution :
Dept. of Comput. Eng. & Math., Univ. Rovira i Virgili, Tarragona, Spain
Abstract :
Automated data collection has fostered the use of data mining for intrusion and crime detection. Indeed, banks, large corporations, insurance companies, casinos, etc. are increasingly mining data about their customers or employees in view of detecting potential intrusion, fraud or even crime. Mining algorithms are trained from datasets which may be biased in what regards gender, race, religion or other attributes. Furthermore, mining is often outsourced or carried out in cooperation by several entities. For those reasons, discrimination concerns arise. Potential intrusion, fraud or crime should be inferred from objective misbehavior, rather than from sensitive attributes like gender, race or religion. This paper discusses how to clean training datasets and outsourced datasets in such a way that legitimate classification rules can still be extracted but discriminating rules based on sensitive attributes cannot.
Keywords :
data mining; public administration; automated data collection; crime detection; data mining; discrimination prevention; fraud detection; potential intrusion detection; Computer security; Data mining; Data models; Decision making; Itemsets; Training data;
Conference_Titel :
Computational Intelligence in Cyber Security (CICS), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-9905-2
DOI :
10.1109/CICYBS.2011.5949405