Goto

Collaborating Authors

 Reinforcement Learning


2051bd70fc110a2208bdbd4a743e7f79-Paper.pdf

Neural Information Processing Systems

In recent years, we have witnessed tremendous progress in deep reinforcement learning(RL)fortaskssuchasGo,Chess,videogames,androbotcontrol.






LearningDynamicBeliefGraphstoGeneralize onText-BasedGames

Neural Information Processing Systems

GATAis trained using acombination of reinforcement and self-supervised learning. Our workdemonstrates thatthelearned graph-based representations helpagents converge to better policies than their text-only counterparts and facilitate effective generalization across game configurations.


FederatedEnsemble-Directed OfflineReinforcementLearning

Neural Information Processing Systems

We consider the problem of federated offline reinforcement learning (RL), a scenario under which distributed learning agents must collaboratively learn a high-quality control policyonly using small pre-collected datasets generated according to different unknown behavior policies. Naรฏvely combining a standard offline RL approach with a standard federated learning approach to solve this problem can lead to poorly performing policies. In response, we develop the Federated Ensemble-Directed Offline Reinforcement Learning Algorithm (FEDORA), which distills the collective wisdom of the clients using an ensemble learning approach. We develop the FEDORA codebase to utilize distributed compute resources on a federated learning platform. We show that FEDORA significantly outperforms other approaches, including offline RL over the combined data pool, in various complex continuous control environments and realworld datasets.