Optimistic Multi-Agent Policy Gradient for Cooperative Tasks

Open in new window