Iteratively Learn Diverse Strategies with State Distance Information

Neural Information Processing Systems 

In addition, we examine two common computation frameworks for this problem, i.e., population-based training (PBT) and iterative learning