PerSim: Data-efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators

Neural Information Processing Systems 

We perform extensive experiments across several benchmark environments and RL methods.