Reward prediction for representation learning and reward shaping

Open in new window