Title :
Service-Oriented Architecture for High-Dimensional Private Data Mashup
Author :
Fung, Benjamin C M ; Trojer, Thomas ; Hung, Patrick C K ; Xiong, Li ; Al-Hussaeni, Khalil ; Dssouli, Rachida
Author_Institution :
CIISE, Concordia Univ., Montreal, QC, Canada
Abstract :
Mashup is a web technology that allows different service providers to flexibly integrate their expertise and to deliver highly customizable services to their customers. Data mashup is a special type of mashup application that aims at integrating data from multiple data providers depending on the user´s request. However, integrating data from multiple sources brings about three challenges: 1) Simply joining multiple private data sets together would reveal the sensitive information to the other data providers. 2) The integrated (mashup) data could potentially sharpen the identification of individuals and, therefore, reveal their person-specific sensitive information that was not available before the mashup. 3) The mashup data from multiple sources often contain many data attributes. When enforcing a traditional privacy model, such as K-anonymity, the high-dimensional data would suffer from the problem known as the curse of high dimensionality, resulting in useless data for further data analysis. In this paper, we study and resolve a privacy problem in a real-life mashup application for the online advertising industry in social networks, and propose a service-oriented architecture along with a privacy-preserving data mashup algorithm to address the aforementioned challenges. Experiments on real-life data suggest that our proposed architecture and algorithm is effective for simultaneously preserving both privacy and information utility on the mashup data. To the best of our knowledge, this is the first work that integrates high-dimensional data for mashup service.
Keywords :
Internet; advertising; data analysis; data integration; data privacy; service-oriented architecture; social networking (online); K-anonymity; Web technology; data analysis; data attributes; data integration; high dimensionality curse; high-dimensional data; high-dimensional private data mashup; multiple private data sets; online advertising industry; person-specific sensitive information; privacy model; privacy-preserving data mashup algorithm; service-oriented architecture; social networks; Companies; Couplings; Data models; Data privacy; Mashups; Privacy; Social network services; Privacy protection; anonymity; data integration; data mashup; high dimensionality.; service-oriented architecture;
Journal_Title :
Services Computing, IEEE Transactions on
DOI :
10.1109/TSC.2011.13