AITopics | Agents

Collaborating Authors

Agents

News Overviews Instructional Materials AI-Alerts Classics

Emergent Graphical Conventions in a Visual Communication Game

Neural Information Processing SystemsAug-14-2025, 22:17:16 GMT

Due to its iconic nature ( i.e ., perceptual resemblance to or natural association with the referent), drawings serve as a powerful tool to communicate concepts transcending language barriers (Fay et al., 2014). In fact, we humans started to use drawings to convey messages dating back to 40,000-60,000 years ago (Hoffmann et al., 2018; Hawkins et al., 2019).

communication, convention, sketch, (14 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Genre:

Research Report (0.68)
Workflow (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

FACMAC: Factored Multi-Agent Centralised Policy Gradients Bei Peng University of Liverpool T abish Rashid University of Oxford Christian A. Schroeder de Witt

Neural Information Processing SystemsAug-14-2025, 21:39:32 GMT

However, unlike QMIX, there are no inherent constraints on factoring the critic. We thus also employ a nonmonotonic factorisation and empirically demonstrate that its increased representational capacity allows it to solve some tasks that cannot be solved with monolithic, or monotonically factored critics.

agent, facmac, policy gradient, (12 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.41)
Europe > Switzerland (0.04)
Europe > Netherlands > South Holland > Delft (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Delayed Propagation Transformer: A Universal Computation Engine towards Practical Control in Cyber-Physical Systems

Neural Information Processing SystemsAug-14-2025, 21:32:09 GMT

Multi-agent control is a central theme in the Cyber-Physical Systems (CPS) . However, current control methods either receive non-Markovian states due to insufficient sensing and decentralized design, or suffer from poor convergence.

arxiv preprint arxiv, dept, transformer, (12 more...)

Neural Information Processing Systems

Country: North America > United States > Texas > Travis County > Austin (0.04)

Industry:

Transportation > Ground > Road (0.95)
Transportation > Infrastructure & Services (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

Appendix A Pseudocode of DRE-MARL

Neural Information Processing SystemsAug-14-2025, 20:36:07 GMT

The pseudocode for DRE-MARL training is shown in Algorithm 20, which takes the following steps. The property of the received reward in this environment is set to be collaborative. It is a scenario with two agents and three landmarks. Navigation and Reference is that the target landmark of each agent is only known to its partner. We use the abbreviation REF to denote this environment.

dre-marl, reward aggregation, reward uncertainty, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.89)

Add feedback

520425a5a4c2fb7f7fc345078b188201-Paper-Conference.pdf

Neural Information Processing SystemsAug-14-2025, 20:36:05 GMT

estimation, reward estimation, reward uncertainty, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland > Prince George's County > College Park (0.14)
North America > United States > Pennsylvania > Northampton County > Bethlehem (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China > Jilin Province > Changchun (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A code

Neural Information Processing SystemsAug-14-2025, 20:18:32 GMT

This section is meant to give an overview of our opensource code. Together with this git repo, we include a'tutorial colab' - a Jupyter notebooks that can be run in the browser without requiring any local installation at We view this open-source effort as a major contribution of our paper. We present the testbed pseudocode in this section. Recall from Section 3.1 that we We now describe the other parameters we use in the Testbed. In this section, we describe the benchmark agents in Section 3.3 and the choice of various Step 3: compute likelihoods for n = 1, 2, . . .

agent, implementation and hyperparameter sweep, input dimension, (10 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.70)

Add feedback

60106888f8977b71e1f15db7bc9a88d1-Paper.pdf

Neural Information Processing SystemsAug-14-2025, 19:03:24 GMT

international conference, learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada > Quebec > Montreal (0.04)
(2 more...)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

A Appendix

Neural Information Processing SystemsAug-14-2025, 18:56:20 GMT

Algorithm 1 shows the execution rules of parallel programs. Terminate the program if no subsequent subroutine exists. Compute the cost of each possible allocation based on the auxiliary functions. The common hyperparameters are listed below. Name V alue learning rate 3e-4 training steps 10M update batch size 256 number of rollout threads 8 rollout buffer size 4096 8 weight of value loss 0.1 weight of policy loss 1 weight of entropy loss 0.01 In cooperative settings, the goal input of the assistive agent is the leading agent's goal.

agent, subroutine, subtask, (15 more...)

Neural Information Processing Systems

Technology: