Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms Alexander Bukharin Y an Li Yue Y u

Neural Information Processing Systems 

However, ERNIE's adversarial regularization may introduce some training instability.