Federated Reinforcement Distillation with Proxy Experience Memory

Cha, Han, Park, Jihong, Kim, Hyesung, Kim, Seong-Lyun, Bennis, Mehdi

Jul-15-2019–arXiv.org Machine Learning

In distributed reinforcement learning, it is common to exchange the experience memory of each agent and thereby collectively train their local models. The experience memory, however, contains all the preceding state observations and their corresponding policies of the host agent, which may violate the privacy of the agent. To avoid this problem, in this work, we propose a privacy-preserving distributed reinforcement learning (RL) framework, termed federated reinforcement distillation (FRD). The key idea is to exchange a proxy experience memory comprising a pre-arranged set of states and time-averaged policies, thereby preserving the privacy of actual experiences. Based on an advantage actor-critic RL architecture, we numerically evaluate the effectiveness of FRD and investigate how the performance of FRD is affected by the proxy memory structure and different memory exchanging rules.

agent, artificial intelligence, machine learning, (13 more...)

arXiv.org Machine Learning

Jul-15-2019

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - New York (0.04)
    - Florida > Broward County
      - Fort Lauderdale (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe > Finland
  - Northern Ostrobothnia > Oulu (0.05)
- Asia > South Korea
  - Seoul > Seoul (0.04)

Genre:
- Research Report (0.41)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found