Privacy Preserving Recalibration under Domain Shift

Luo, Rachel, Zhao, Shengjia, Song, Jiaming, Kuck, Jonathan, Ermon, Stefano, Savarese, Silvio

Aug-21-2020–arXiv.org Artificial Intelligence

Classifiers deployed in high-stakes real-world applications must output calibrated confidence scores, i.e. their predicted probabilities should reflect empirical frequencies. Recalibration algorithms can greatly improve a model's probability estimates; however, existing algorithms are not applicable in real-world situations where the test data follows a different distribution from the training data, and privacy preservation is paramount (e.g. protecting patient records). We introduce a framework that abstracts out the properties of recalibration problems under differential privacy constraints. This framework allows us to adapt existing recalibration algorithms to satisfy differential privacy while remaining effective for domain-shift situations. Guided by our framework, we also design a novel recalibration algorithm, accuracy temperature scaling, that outperforms prior work on private datasets. In an extensive empirical study, we find that our algorithm improves calibration on domain-shift benchmarks under the constraints of differential privacy. On the 15 highest severity perturbations of the ImageNet-C dataset, our method achieves a median ECE of 0.029, over 2x better than the next best recalibration method and almost 5x better than without recalibration.

artificial intelligence, data mining, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Aug-21-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > Santa Clara County > Palo Alto (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.64)

Industry:
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine
  - Diagnostic Medicine (0.92)
  - Government Relations & Public Policy (0.67)
  - Health Care Providers & Services > Reimbursement (0.45)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning
      - Neural Networks (1.00)
      - Statistical Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found