Title :
Privacy-Preserving k-NN for Small and Large Data Sets
Author :
Amirbekyan, Artak
Author_Institution :
Griffith Univ., Meadowbrook
Abstract :
It is not surprising that there is strong interest in k- NN queries to enable clustering, classification and outlier- detection tasks. However, previous approaches to privacy-preserving k-NN are costly and can only be realistically applied to small data sets. We provide efficient solutions for k-NN queries queries for vertically partitioned data. We provide the first solution for the Linfin (or Chessboard) metric as well as detailed privacy-preserving computation of all other Minkowski metrics. We enable privacy-preserving Linfin by providing a solution to the Yao´s Millionaire Problem with more than two parties. This is based on a new and practical solution to Yao´s Millionaire with shares. We also provide privacy-preserving algorithms for combinations of local metrics into a global that handles the large dimensionality and diversity of attributes common in vertically partitioned data.
Keywords :
data handling; data mining; data privacy; pattern classification; pattern clustering; Minkowski metrics; Yao millionaire problem; attribute diversity; chessboard metric; data dimensionality; data set privacy preservation; k-NN queries; outlier detection; pattern classification; pattern clustering; vertically partitioned data; Australia; Circuits; Collaboration; Data mining; Data privacy; Databases; Neural networks; Partitioning algorithms; Polynomials; Sliding mode control;
Conference_Titel :
Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
Print_ISBN :
978-0-7695-3019-2
Electronic_ISBN :
978-0-7695-3033-8
DOI :
10.1109/ICDMW.2007.67