Differentially Private Policy Gradient

Rio, Alexandre, Barlier, Merwan, Colin, Igor

Jan-31-2025–arXiv.org Artificial Intelligence

Motivated by the increasing deployment of reinforcement learning in the real world, involving a large consumption of personal data, we introduce a differentially private (DP) policy gradient algorithm. We show that, in this setting, the introduction of Differential Privacy can be reduced to the computation of appropriate trust regions, thus avoiding the sacrifice of theoretical properties of the DP-less methods. Therefore, we show that it is possible to find the right trade-off between privacy noise and trust-region size to obtain a performant differentially private policy gradient algorithm. We then outline its performance empirically on various benchmarks. Our results and the complexity of the tasks addressed represent a significant improvement over existing DP algorithms in online RL.

machine learning, natural language, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

Jan-31-2025

arXiv.org PDF

Add feedback

Country:
- Oceania > Palau (0.04)
- North America
  - United States
    - Nevada (0.04)
    - New York
      - Richmond County > New York City (0.04)
      - Queens County > New York City (0.04)
      - New York County > New York City (0.04)
      - Kings County > New York City (0.04)
      - Bronx County > New York City (0.04)
    - Colorado > Denver County
      - Denver (0.04)
  - Puerto Rico > San Juan
    - San Juan (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Spain > Valencian Community
    - Valencia Province > Valencia (0.04)
  - France
    - Île-de-France > Paris
      - Paris (0.04)
    - Hauts-de-France > Nord
      - Lille (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - South Korea > Daegu
    - Daegu (0.04)
  - China > Beijing
    - Beijing (0.04)

Genre:
- Research Report > New Finding (0.48)

Industry:
- Information Technology > Security & Privacy (1.00)
- Health & Medicine > Therapeutic Area
  - Endocrinology > Diabetes (0.46)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Natural Language (1.00)
    - Machine Learning
      - Reinforcement Learning (1.00)
      - Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found