CaRL: Learning Scalable Planning Policies with Simple Rewards

Open in new window