RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning

Open in new window