• DocumentCode
    3252425
  • Title

    A privacy preserving Jaccard similarity function for mining encrypted data

  • Author

    Singh, Meena Dilip ; Krishna, Radha P. ; Saxena, Ashutosh

  • Author_Institution
    SETLabs, Infosys Technol. Ltd., Bangalore, India
  • fYear
    2009
  • fDate
    23-26 Jan. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Due to advances in data collection and increasing dependency on data mining experts, preserving privacy of the data is a major concern when mining the data. Most of the classifier implementations for data mining have the tradeoff between classification accuracy and maintenance of data privacy. Another important aspect in distance-based classifiers is to accurately compute distance (or similarity) between two or more data points. In privacy preserving data mining techniques, providing a suitable distance measure to classify the data while maintaining data privacy is a challenging task. In this paper, we present an approach to compute similarity between two encrypted data points. We augmented Jaccard similarity function with Private Equality Test protocol facilitating a semi honest third party to conduct the equality test. The proposed privacy preserving scheme provides an efficient mechanism for similarity computation with reduced communication cost for mining the data.
  • Keywords
    cryptography; data mining; data privacy; classifier implementations; encrypted data mining; encrypted data points; privacy preserving jaccard similarity function; private equality test protocol; similarity computation; Clustering algorithms; Costs; Cryptographic protocols; Cryptography; Data mining; Data privacy; Data security; Databases; Testing; Usability; Cryptography; Encrypted Data; Privacy Preserving Data Mining; Private Equality Test;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2009 - 2009 IEEE Region 10 Conference
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4244-4546-2
  • Electronic_ISBN
    978-1-4244-4547-9
  • Type

    conf

  • DOI
    10.1109/TENCON.2009.5395869
  • Filename
    5395869