Policy Gradient with Self-Attention for Model-Free Distributed Nonlinear Multi-Agent Games

Open in new window