AITopics | Agents

Collaborating Authors

Agents

News Overviews Instructional Materials AI-Alerts Classics

Learning Affordance Landscapes for Interaction Exploration in 3D Environments

Neural Information Processing SystemsOct-2-2025, 05:10:36 GMT

Embodied agents operating in human spaces must be able to master how their environment works: what objects can the agent use, and how can it use them? We introduce a reinforcement learning approach for exploration for interaction, whereby an embodied agent autonomously discovers the affordance landscape of a new unmapped 3D environment (such as an unfamiliar kitchen).

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.69)

Add feedback

Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control

Sai Qian Zhang, Qi Zhang, Jieyu Lin

Neural Information Processing SystemsOct-2-2025, 04:32:21 GMT

Multi-agent reinforcement learning (MARL) has recently received considerable attention due to its applicability to a wide range of real-world applications.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.28)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Add feedback

14cfdb59b5bda1fc245aadae15b1984a-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 04:32:07 GMT

We thank the reviewers for their insightful comments. We will incorporate the feedback and suggestions into the next revision of the paper. A: The messages exchanged between the agents generally convey agent status information (location, health status, etc.) Overtime, communication level gradually decreases as agents move to the right position (step 250,430). We can also design similar experiments to infer the meaning of other types of messages. A: VBC is most beneficial to multi-agent systems that require quick decision making and low communication overhead.

artificial intelligence, reviewer, variance, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games

Neural Information Processing SystemsOct-2-2025, 04:12:57 GMT

The stochastic game (SG) is a classical multi-agent model that has received extensive attention in recent MARL studies.

artificial intelligence, gradient play, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.85)

Add feedback

Discovery of Useful Questions as Auxiliary Tasks

Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Janarthanan Rajendran, Richard L. Lewis, Junhyuk Oh, Hado P. van Hasselt, David Silver, Satinder Singh

Neural Information Processing SystemsOct-2-2025, 03:10:28 GMT

Arguably, intelligent agents ought to be able to discover their own questions so that in learning answers for them they learn unanticipated useful knowledge and skills; this departs from the focus in much of machine learning on agents learning answers to externally defined questions. We present a novel method for a reinforcement learning (RL) agent to discover questions formulated as general value functions or GVFs, a fairly rich form of knowledge representation. Specifically, our method uses non-myopic meta-gradients to learn GVF-questions such that learning answers to them, as an auxiliary task, induces useful representations for the main task faced by the RL agent. We demonstrate that auxiliary tasks based on the discovered GVFs are sufficient, on their own, to build representations that support main task learning, and that they do so better than popular hand-designed auxiliary tasks from the literature. Furthermore, we show, in the context of Atari 2600 videogames, how such auxiliary tasks, meta-learned alongside the main task, can improve the data efficiency of an actor-critic agent.

auxiliary task, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.46)
North America > United States (0.28)

Genre: Research Report (0.88)

Industry: Leisure & Entertainment > Games > Computer Games (0.49)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Learning Fairness in Multi-Agent Systems

Neural Information Processing SystemsOct-2-2025, 02:51:53 GMT

Fairness is essential for human society, contributing to stability and productivity. Similarly, fairness is also the key for many multi-agent systems.

agent, artificial intelligence, fairness, (14 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment (0.70)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Market Scoring Rules Act As Opinion Pools For Risk-Averse Agents

Mithun Chakraborty, Sanmay Das

Neural Information Processing SystemsOct-2-2025, 02:31:32 GMT

A market scoring rule (MSR) - a popular tool for designing algorithmic prediction markets - is an incentive-compatible mechanism for the aggregation of probabilistic beliefs from myopic risk-neutral agents. In this paper, we add to a growing body of research aimed at understanding the precise manner in which the price process induced by a MSR incorporates private information from agents who deviate from the assumption of risk-neutrality. We first establish that, for a myopic trading agent with a risk-averse utility function, a MSR satisfying mild regularity conditions elicits the agent's risk-neutral probability conditional on the latest market state rather than her true subjective probability. Hence, we show that a MSR under these conditions effectively behaves like a more traditional method of belief aggregation, namely an opinion pool, for agents' true probabilities.

agent, opinion pool, probability, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Missouri > St. Louis County > St. Louis (0.04)
North America > United States > Michigan (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Explainable Voting

Neural Information Processing SystemsOct-2-2025, 02:22:20 GMT

The design of voting rules is traditionally guided by desirable axioms.

artificial intelligence, axiom, explanation, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.47)

Industry: Government > Voting & Elections (0.93)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

Unsupervised Emergence of Egocentric Spatial Structure from Sensorimotor Prediction

Alban Laflaquière, Michael Garcia Ortiz

Neural Information Processing SystemsOct-2-2025, 01:41:04 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, spatial reasoning, (15 more...)

Neural Information Processing Systems

Country: Europe > France (0.14)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.69)

Add feedback

No-Regret Learning and Mixed Nash Equilibria: They Do Not Mix

Neural Information Processing SystemsOct-2-2025, 01:32:52 GMT

As such, several crucial questions arise: What are the game-theoretic implications of the no-regret guarantees of FTRL? Do the dynamics of FTRL converge to an equilibrium of the underlying game? A folk answer to this question is that " no-regret learning converges to equilibrium in all games "

artificial intelligence, asymptotically stable, machine learning, (19 more...)

Neural Information Processing Systems

Country: