A Ablations

Aug-15-2025, 23:29:18 GMT–Neural Information Processing Systems

We find that past play greatly stabilizes the emergence of reciprocity in IPD. In cells containing another agent, we include the RUSP observations in these channels. In Figure 11 we show results when training with RUSP in these environments. Consistent with past work, the greedy baseline fails to reach a solution with high collective return. We use a distributed computing infrastructure used in Berner et al.

action head, agent, prisoner, (16 more...)

Neural Information Processing Systems

Aug-15-2025, 23:29:18 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (0.49)
  - Machine Learning
    - Neural Networks (0.31)
    - Reinforcement Learning (0.31)

Duplicate Docs Excel Report

Title
b63c87b0a41016ad29313f0d7393cee8-Supplemental.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found