Title :
Privacy and the Price of Data
Author_Institution :
Dept. of Comput. Sci. & Eng., Penn State Univ., University Park, PA, USA
Abstract :
Several important application areas in data science involve assigning numbers to (possibly randomized) algorithms. In the case of statistical privacy, it is important to quantify the amount of information leaked by a data processing algorithm. In the case of data marketplaces, it is important to properly set the prices for data queries (which, in general, can be specified by arbitrary algorithms). In both cases, it is also important to quantify application-specific utility of the outputs of an algorithm or query. For example, if a user has a choice of purchasing some combination of query answers, the user´s decision should be guided by the consideration of a utility/price trade-off. These numbers cannot be assigned to algorithms in an arbitrary manner - there are common-sense restrictions that must be enforced. For example, the answer to an expensive query should not be derivable from a much cheaper query. Each application has its own set of restrictions but when they are formulated mathematically, common patterns emerge. Thus technical results from one area can often be ported to the others.
Keywords :
data privacy; database management systems; query processing; application-specific utility; data marketplaces; data price; data privacy; data processing algorithm; data queries; data science; data sets; randomized algorithms; statistical privacy; utility/price trade-off; Accidents; Computer science; Data privacy; Data processing; Databases; Pricing; Privacy; data marketplace; query pricing; statistical privacy; utility;
Conference_Titel :
Logic in Computer Science (LICS), 2015 30th Annual ACM/IEEE Symposium on
Conference_Location :
Kyoto
DOI :
10.1109/LICS.2015.10