OptStream: Releasing Time Series Privately
Fioretto, Ferdinando, Van Hentenryck, Pascal
–arXiv.org Artificial Intelligence
Many applications of machine learning and optimization operate on data streams. While these datasets are fundamental to fuel decision-making algorithms, often they contain sensitive information about individuals and their usage poses significant privacy risks. Motivated by an application in energy systems, this paper presents OPTSTREAM, a novel algorithm for releasing differentially private data streams under the w-event model of privacy. OPTSTREAM is a 4-step procedure consisting of sampling, perturbation, reconstruction, and post-processing modules. First, the sampling module selects a small set of points to access in each period of interest. Then, the perturbation module adds noise to the sampled data points to guarantee privacy. Next, the reconstruction module reassembles non-sampled data points from the perturbed sample points. Finally, the post-processing module uses convex optimization over the private output of the previous modules, as well as the private answers of additional queries on the data stream, to improve accuracy by redistributing the added noise. OPTSTREAM is evaluated on a test case involving the release of a real data stream from the largest European transmission operator. Experimental results show that OPTSTREAM may not only improve the accuracy of state-of-the-art methods by at least one order of magnitude but also supports accurate load forecasting on the private data.
arXiv.org Artificial Intelligence
Oct-30-2018
- Country:
- North America
- Europe > France
- Auvergne-Rhône-Alpes (0.05)
- Île-de-France (0.04)
- Provence-Alpes-Côte d'Azur (0.04)
- Pays de la Loire (0.04)
- Nouvelle-Aquitaine (0.04)
- Hauts-de-France (0.04)
- Centre-Val de Loire (0.04)
- Bourgogne-Franche-Comté (0.04)
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Energy > Power Industry (1.00)
- Technology:
- Information Technology
- Security & Privacy (1.00)
- Data Science (1.00)
- Communications (1.00)
- Artificial Intelligence
- Machine Learning (1.00)
- Representation & Reasoning > Optimization (0.46)
- Information Technology