Generalizing from a few environments in safety-critical reinforcement learning

Open in new window