Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity

Neural Information Processing Systems 

The wider application of end-to-end learning methods to embodied decision-making domains remains bottlenecked by their reliance on a superabundance of training data representative of the target domain.