When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits

Jan-18-2025, 22:48:44 GMT–Neural Information Processing Systems

We study the problem of multi-armed bandits with ε-global Differential Privacy (DP). First, we prove the minimax and problem-dependent regret lower bounds for stochastic and linear bandits that quantify the hardness of bandits with ε-global DP. These bounds suggest the existence of two hardness regimes depending on the privacy budget ε. In the high-privacy regime (small ε), the hardness depends on a coupled effect of privacy and partial information about the reward distributions. In the low-privacy regime (large ε), bandits with ε-global DP are not harder than the bandits without privacy.

bandit, differentially private bandit, privacy meet partial information, (6 more...)

Neural Information Processing Systems

Jan-18-2025, 22:48:44 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Artificial Intelligence (0.43)
  - Data Science > Data Mining
    - Big Data (0.43)