Adapting Image-based RL Policies via Predicted Rewards

Open in new window