On-Policy Robot Imitation Learning from a Converging Supervisor

Open in new window