Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices

Open in new window