Demonstration-efficient Inverse Reinforcement Learning in Procedurally Generated Environments

Open in new window