Economics of privacy and data labor

Workshop

Economics of privacy and data labor

Nikolaos Vasiloglou · Rachel Cummings · Glen Weyl · Paris Koutris · Meg Young · Ruoxi Jia · David Dao · Bo Waggoner

Sat 18 Jul, 6 a.m. PDT

Keywords: Economics Market design Auctions Privacy Data labor Data pricing Data Valuation Data Markets Data Exchanges

[ Abstract ] Workshop Website

Although data is considered to be the “new oil”, it is very hard to be priced. Raw use of data has been invaluable in several sectors such as advertising, healthcare, etc, but often in violation of people’s privacy. Labeled data has also been extremely valuable for the training of machine learning models (driverless car industry). This is also indicated by the growth of annotation companies such as Figure8 and Scale.AI, especially in the image space. Yet, it is not clear what is the right pricing for data workers who annotate the data or the individuals who contribute their personal data while using digital services. In the latter case, it is very unclear how the value of the services offered is compared to the private data exchanged. While the first data marketplaces have appeared, such as AWS, Narattive.io, nitrogen.ai, etc, they suffer from a lack of good pricing models. They also fail to maintain the right of the data owners to define how their own data will be used. There have been numerous suggestions for sharing data while maintaining privacy, such as training generative models that preserve original data statistics.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Sat 7:00 a.m. - 7:15 a.m.	Designing Differentially Private Estimators in High Dimensions by Aditya Dhar ( Paper ) >	🔗
Sat 7:15 a.m. - 7:30 a.m.	Really Useful Synthetic Data – A Framework to Evaluate the Quality of Differentially Private Synthetic Data by Christian Arnold ( Paper ) >	🔗
Sat 7:30 a.m. - 7:45 a.m.	Generating Privacy-Preserving Synthetic Tabular Data Using Oblivious Variational Autoencoders by L Vivek Harsha ( Paper ) >	🔗
Sat 7:45 a.m. - 8:00 a.m.	Break	🔗
Sat 8:00 a.m. - 9:00 a.m.	Buying data over time by Nicole Immorlica ( Invited Talk ) >	🔗
Sat 9:00 a.m. - 9:15 a.m.	Optimal Query Complexity of Secure Stochastic Convex Optimization by Wei Tang ( Paper ) >	🔗
Sat 9:15 a.m. - 9:30 a.m.	On Detecting Data Pollution Attacks On Recommender Systems Using Sequential GANs by Behzad Shahrasb ( Paper ) >	🔗
Sat 9:30 a.m. - 9:45 a.m.	Efficient Privacy-Preserving Stochastic Nonconvex Optimization by Lingxiao Wang ( Paper ) >	🔗
Sat 9:45 a.m. - 10:30 a.m.	Break	🔗
Sat 10:30 a.m. - 10:45 a.m.	European Privacy Law and Global Markets for Data by Christian Peukert ( Paper ) >	🔗
Sat 10:45 a.m. - 11:00 a.m.	To Call or not to Call? Using ML Prediction APIs more Accurately and Economically by Lingjiao Chen ( Paper ) >	🔗
Sat 11:00 a.m. - 11:15 a.m.	Do Markets Make Sense for Personal Data? by Aileen Nielsen ( Paper ) >	🔗
Sat 11:15 a.m. -	BREAK	🔗
Sat 11:30 a.m. - 12:30 p.m.	Intersectional Social Data by Glen Weyl ( Keynote ) >	🔗