AITopics | Agents

A Ablations

Neural Information Processing SystemsAug-15-2025, 23:29:18 GMT

We find that past play greatly stabilizes the emergence of reciprocity in IPD. In cells containing another agent, we include the RUSP observations in these channels. In Figure 11 we show results when training with RUSP in these environments. Consistent with past work, the greedy baseline fails to reach a solution with high collective return. We use a distributed computing infrastructure used in Berner et al.

action head, agent, prisoner, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.31)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)

Add feedback

Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences

Neural Information Processing SystemsAug-15-2025, 23:29:11 GMT

Multi-agent reinforcement learning (MARL) has shown recent success in increasingly complex fixed-team zero-sum environments.

agent, prisoner, social dilemma, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.47)

Add feedback

b628386c9b92481fab68fbf284bd6a64-Supplemental.pdf

Neural Information Processing SystemsAug-15-2025, 23:28:50 GMT

agent, algorithm, coachreg, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.50)

Add feedback

b628386c9b92481fab68fbf284bd6a64-Paper.pdf

Neural Information Processing SystemsAug-15-2025, 23:28:43 GMT

agent, coordination, reinforcement learning, (12 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

b628386c9b92481fab68fbf284bd6a64-AuthorFeedback.pdf

Neural Information Processing SystemsAug-15-2025, 23:28:31 GMT

agent, baseline, maddpg, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.73)

Add feedback

Sample-Efficient Reinforcement Learning of Partially Observable Markov Games

Neural Information Processing SystemsAug-15-2025, 22:43:23 GMT

POMGs--in which sample-efficient learning is tractable.

algorithm, information, pomg, (12 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.94)

Add feedback

8d9a6e908ed2b731fb96151d9bb94d49-Supplemental.pdf

Neural Information Processing SystemsAug-15-2025, 21:08:13 GMT

agent, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.31)

Add feedback

An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning

Neural Information Processing SystemsAug-15-2025, 21:08:10 GMT

Similarly, Multiagent RL (MARL) can also be accelerated if agents can share knowledge with each other.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Tianjin Province > Tianjin (0.04)
Asia > Macao (0.04)
Asia > China > Liaoning Province > Dalian (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.93)

Add feedback

8caa38721906c1a0bb95c80fab33a893-Supplemental.pdf

Neural Information Processing SystemsAug-15-2025, 20:21:35 GMT

V100 GPUs to train the models. Consortium and are licensed under a Creative Commons Attribution 4.0 License. Similarly, for evaluating the agent listener with a human speaker, each agent evaluates 400 human utterances in Fig 5b. In Fig 10, we present the results of the human evaluation on the text game. Sec 4.3, we show that agents trained using our method beat all prior baselines when paired with both The blue bars show the standard deviation across all agents present in the buffer.

artificial intelligence, machine learning, utterance, (19 more...)

Neural Information Processing Systems

Industry: Information Technology (0.49)

Technology: