Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning

Open in new window