• DocumentCode
    1458431
  • Title

    Service-Oriented Architecture for High-Dimensional Private Data Mashup

  • Author

    Fung, Benjamin C M ; Trojer, Thomas ; Hung, Patrick C K ; Xiong, Li ; Al-Hussaeni, Khalil ; Dssouli, Rachida

  • Author_Institution
    CIISE, Concordia Univ., Montreal, QC, Canada
  • Volume
    5
  • Issue
    3
  • fYear
    2012
  • Firstpage
    373
  • Lastpage
    386
  • Abstract
    Mashup is a web technology that allows different service providers to flexibly integrate their expertise and to deliver highly customizable services to their customers. Data mashup is a special type of mashup application that aims at integrating data from multiple data providers depending on the user´s request. However, integrating data from multiple sources brings about three challenges: 1) Simply joining multiple private data sets together would reveal the sensitive information to the other data providers. 2) The integrated (mashup) data could potentially sharpen the identification of individuals and, therefore, reveal their person-specific sensitive information that was not available before the mashup. 3) The mashup data from multiple sources often contain many data attributes. When enforcing a traditional privacy model, such as K-anonymity, the high-dimensional data would suffer from the problem known as the curse of high dimensionality, resulting in useless data for further data analysis. In this paper, we study and resolve a privacy problem in a real-life mashup application for the online advertising industry in social networks, and propose a service-oriented architecture along with a privacy-preserving data mashup algorithm to address the aforementioned challenges. Experiments on real-life data suggest that our proposed architecture and algorithm is effective for simultaneously preserving both privacy and information utility on the mashup data. To the best of our knowledge, this is the first work that integrates high-dimensional data for mashup service.
  • Keywords
    Internet; advertising; data analysis; data integration; data privacy; service-oriented architecture; social networking (online); K-anonymity; Web technology; data analysis; data attributes; data integration; high dimensionality curse; high-dimensional data; high-dimensional private data mashup; multiple private data sets; online advertising industry; person-specific sensitive information; privacy model; privacy-preserving data mashup algorithm; service-oriented architecture; social networks; Companies; Couplings; Data models; Data privacy; Mashups; Privacy; Social network services; Privacy protection; anonymity; data integration; data mashup; high dimensionality.; service-oriented architecture;
  • fLanguage
    English
  • Journal_Title
    Services Computing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1939-1374
  • Type

    jour

  • DOI
    10.1109/TSC.2011.13
  • Filename
    5719598