Title :
Privacy Preserving Nearest Neighbor Search
Author :
Shaneck, Mark ; Kim, Yongdae ; Kumar, Vipin
Author_Institution :
Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN
Abstract :
Data mining is frequently obstructed by privacy concerns. In many cases data is distributed, and bringing the data together in one place for analysis is not possible due to privacy laws (e.g. HIPAA) or policies. Privacy preserving data mining techniques have been developed to address this issue by providing mechanisms to mine the data while giving certain privacy guarantees. In this work we address the issue of privacy preserving nearest neighbor search, which forms the kernel of many data mining applications. To this end, we present a novel algorithm based on secure multiparty computation primitives to compute the nearest neighbors of records in horizontally distributed data. We show how this algorithm can be used in three important data mining algorithms, namely LOF outlier detection, SNN clustering, and kNN classification
Keywords :
data mining; data privacy; pattern clustering; LOF outlier detection; SNN clustering; data mining; kNN classification; multiparty computation primitives; nearest neighbor search; privacy preservation; Clustering algorithms; Computer science; Conferences; Cryptography; Data mining; Data privacy; Distributed computing; Kernel; Medical diagnostic imaging; Nearest neighbor searches;
Conference_Titel :
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2702-7
DOI :
10.1109/ICDMW.2006.133