Federated Ensemble-Directed Offline Reinforcement Learning
–Neural Information Processing Systems
We consider the problem of federated offline reinforcement learning (RL), a scenario under which distributed learning agents must collaboratively learn a high-quality control policy only using small pre-collected datasets generated according to different unknown behavior policies. Naïvely combining a standard offline RL approach with a standard federated learning approach to solve this problem can lead to poorly performing policies. In response, we develop the Federated Ensemble-Directed Offline Reinforcement Learning Algorithm (FEDORA), which distills the collective wisdom of the clients using an ensemble learning approach. We develop the FEDORA codebase to utilize distributed compute resources on a federated learning platform. We show that FEDORA significantly outperforms other approaches, including offline RL over the combined data pool, in various complex continuous control environments and real-world datasets.
Neural Information Processing Systems
Nov-20-2025, 08:37:05 GMT
- Country:
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Energy (0.93)
- Information Technology > Security & Privacy (0.67)
- Technology: