Title : 
Non-linear Dimensionality Reduction for Privacy-Preserving Data Classification
         
        
            Author : 
AlOtaibi, Khaled ; Rayward-Smith, V.J. ; Wenjia Wang ; de la Iglesia, B.
         
        
            Author_Institution : 
Sch. of Comput. Sci., Univ. of East Anglia, Norwich, UK
         
        
        
        
        
        
            Abstract : 
Many techniques have been proposed to protect the privacy of data outsourced for analysis by external parties. However, most of these techniques distort the underlying data properties, and therefore, hinder data mining algorithms from discovering patterns. The aim of Privacy-Preserving Data Mining (PPDM) is to generate a data-friendly transformation that maintains both the privacy and the utility of the data. We have proposed a novel privacy-preserving framework based on non-linear dimensionality reduction (i.e. non-metric multidimensional scaling) to perturb the original data. The perturbed data exhibited good utility in terms of distance-preservation between objects. This was tested on a clustering task with good results. In this paper, we test our novel PPDM approach on a classification task using a k-Nearest Neighbour (k-NN) classification algorithm. We compare the classification results obtained from both the original and the perturbed data and find them to be much same particularly for the few lower dimensions. We show that, for distance-based classification, our approach preserves the utility of the data while hiding the private details.
         
        
            Keywords : 
data mining; data privacy; data reduction; pattern classification; PPDM approach; data mining algorithm; data properties; data utility; data-friendly transformation; distance preservation; distance-based classification; external parties; k-NN classification algorithm; k-nearest neighbour classification algorithm; nonlinear dimensionality reduction; nonmetric multidimensional scaling; outsourcing; privacy-preserving data classification; Additives; Algorithm design and analysis; Data privacy; Noise; Stress; Uncertainty; Data Perturbation; Multidimensional Scaling; Privacy-Preserving Data Mining; k-NN Classification;
         
        
        
        
            Conference_Titel : 
Privacy, Security, Risk and Trust (PASSAT), 2012 International Conference on and 2012 International Confernece on Social Computing (SocialCom)
         
        
            Conference_Location : 
Amsterdam
         
        
            Print_ISBN : 
978-1-4673-5638-1
         
        
        
            DOI : 
10.1109/SocialCom-PASSAT.2012.76