Supplementary Materials of The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
–Neural Information Processing Systems
We assume here that all agents share critic and actor networks, for notational convenience. Gaussian Distribution, from which an action is sampled, in continuous action spaces. In the loss functions above, B refers to the batch size and n refers to the number of agents. Multi-agent Particle-World Environment (MPE) was introduced in (Lowe et al., 2017). StarCraftII Micromanagement Challenge (SMAC) tasks were introduced in (Rashid et al., 2019).
Neural Information Processing Systems
Aug-17-2025, 06:01:25 GMT
- Country:
- Asia > China
- North America > United States
- California > Alameda County > Berkeley (0.04)
- Technology: