Title :
Privacy Preserving Collaborative Data Mining
Author_Institution :
Carnegie Mellon Univ., Pittsburgh
Abstract :
Data mining and knowledge discovery in databases are important research areas that investigate the automatic extraction of previously unknown patterns from large amounts of data. The field connects the three worlds of databases, artificial intelligence and statistics. However, the usefulness of this data is negligible if meaningful information or knowledge cannot be extracted from it. Data mining and knowledge discovery attempts to answer this need. Aiming at developing practical solutions to privacy-preserving data mining problems, we have applied the random perturbation technique and the randomized response technique. The idea is to add random noise to the original data so that it is hidden. In another field, the success of homeland security aiming to counter terrorism depends on a combination of strength across different mission areas, effective international collaboration and information sharing to support a coalition in which different organizations and nations must share some, but not all, information. Information privacy thus becomes extremely important and our technique can be applied. In the Internet era, collaborative data mining is becoming a popular way to extract useful knowledge from large databases.
Keywords :
Internet; data mining; data privacy; information retrieval; security of data; very large databases; Internet; collaborative data mining; data privacy; homeland security; information sharing; knowledge discovery; large database; random perturbation technique; Artificial intelligence; Counting circuits; Data mining; Data privacy; Databases; International collaboration; Internet; Perturbation methods; Statistics; Terrorism;
Conference_Titel :
Intelligence and Security Informatics, 2007 IEEE
Conference_Location :
New Brunswick, NJ
Electronic_ISBN :
1-4244-1329-X
DOI :
10.1109/ISI.2007.379472