Offline Reinforcement Learning via Inverse Optimization

Open in new window