SupplementaryMaterialsof TheSurprisingEffectivenessofPPOinCooperative Multi-AgentGames

Feb-11-2026, 00:26:33 GMT–Neural Information Processing Systems

We consider the 3 fully cooperative tasks from the original set shown in Figure 1(a):Spread, Comm,andReference. "Use feature normalization" refers to whether the feature normalization is applied to the networkinput. In this appendix section, we include results which demonstrate the benefit of parameter sharing. Note that our global state to the value network has agent-specific information, such as available actions and relative distances to other agents. When an agent dies, these agent-specific features become zero, while the remaining agent-agnostic features remain nonzero -this leads to adrastic distribution shift in the critic input compared to states in which the agent is alive.

artificial intelligence, easy 100, machine learning, (18 more...)

Neural Information Processing Systems

Feb-11-2026, 00:26:33 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.93)

Duplicate Docs Excel Report

Title
Supplementary Materials of The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games

Similar Docs Excel Report more

Title	Similarity	Source
None found