A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback

Open in new window