AITopics | Agents

Collaborating Authors

Agents

News Overviews Instructional Materials AI-Alerts Classics

7e6361a5d73a8fab093dd8453e0b106f-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 04:25:55 GMT

Modeling multi-agent systems requires understanding howagents interact. Such systems are often difficult to model because they can involve a variety of types ofinteractions that layer together todriverich social behavioral dynamics.

artificial intelligence, graph, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County > Palo Alto (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

95f2b84de5660ddf45c8a34933a2e66f-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 02:19:59 GMT

agent, diplomacy, equilibrium, (16 more...)

Neural Information Processing Systems

Country:

Europe > France (0.05)
Europe > Austria (0.05)

Genre: Research Report > New Finding (0.47)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

ba4849411c8bbdd386150e5e32204198-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-10-2026, 01:57:46 GMT

To test the efficiency of each component, we remove them separately (LG-ODE-no att,7 LG-ODE-no PE) and find the performances drop. This suggests that distinguishing the importance of nodes w.r.t8 time and incorporating temporal information via learnable positional encoding would benefit model performance.9 ForEqn2, we adopt the GNN model in[2]tocapture the interaction among agents.

artificial intelligence, asshownintable1, graph structure, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.35)

Add feedback

ALawofIteratedLogarithmforMulti-Agent ReinforcementLearning

Neural Information Processing SystemsFeb-10-2026, 01:16:03 GMT

In contrast, the mathematics needed to analyze such schemes is what forms the focus in Stochastic Approximation (SA) theory [2, 4]. More generally, SA refers to an iterative scheme that helps find zeroes or optimal points of a function, for which only noisy evaluationsarepossible.

lnn, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Costa Rica > Heredia Province > Heredia (0.04)
Asia > India > Karnataka > Bengaluru (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

955fd82131e15e7b5199cbc8f983306a-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 01:15:53 GMT

algorithm, approximation, lemma 4, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > Costa Rica > Heredia Province > Heredia (0.04)
Asia > India > Karnataka > Bengaluru (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

PessimismMeets Invariance: ProvablyEfficient OfflineMean-FieldMulti-AgentRL

Neural Information Processing SystemsFeb-10-2026, 01:14:54 GMT

Most existing results only focus on online settings, in which agents can interact with the environment during training.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Add feedback

Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL

Neural Information Processing SystemsFeb-10-2026, 01:14:35 GMT

Thenfor N ( log (de H/ ))sufficientlylarge, withprobability1 , wehave (b ;!)= O

artificial intelligence, machine learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.42)

Add feedback

34f1c2e7ab91b6fa481ad0286a08ad02-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 01:14:24 GMT

equilibria, sequence, theorem 3, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(3 more...)

Industry:

Leisure & Entertainment > Games (0.67)
Education (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.67)

Add feedback

Online Learning for Uninformed Markov Games: Empirical Nash-Value Regret and Non-Stationarity Adaptation

Liu, Junyan, Luo, Haipeng, Zhang, Zihan, Ratliff, Lillian J.

arXiv.org Machine LearningFeb-10-2026

We study online learning in two-player uninformed Markov games, where the opponent's actions and policies are unobserved. In this setting, Tian et al. (2021) show that achieving no-external-regret is impossible without incurring an exponential dependence on the episode length $H$. They then turn to the weaker notion of Nash-value regret and propose a V-learning algorithm with regret $O(K^{2/3})$ after $K$ episodes. However, their algorithm and guarantee do not adapt to the difficulty of the problem: even in the case where the opponent follows a fixed policy and thus $O(\sqrt{K})$ external regret is well-known to be achievable, their result is still the worse rate $O(K^{2/3})$ on a weaker metric. In this work, we fully address both limitations. First, we introduce empirical Nash-value regret, a new regret notion that is strictly stronger than Nash-value regret and naturally reduces to external regret when the opponent follows a fixed policy. Moreover, under this new metric, we propose a parameter-free algorithm that achieves an $O(\min \{\sqrt{K} + (CK)^{1/3},\sqrt{LK}\})$ regret bound, where $C$ quantifies the variance of the opponent's policies and $L$ denotes the number of policy switches (both at most $O(K)$). Therefore, our results not only recover the two extremes -- $O(\sqrt{K})$ external regret when the opponent is fixed and $O(K^{2/3})$ Nash-value regret in the worst case -- but also smoothly interpolate between these extremes by automatically adapting to the opponent's non-stationarity. We achieve so by first providing a new analysis of the epoch-based V-learning algorithm by Mao et al. (2022), establishing an $O(ηC + \sqrt{K/η})$ regret bound, where $η$ is the epoch incremental factor. Next, we show how to adaptively restart this algorithm with an appropriate $η$ in response to the potential non-stationarity of the opponent, eventually achieving our final results.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

2602.07205

Country:

North America > United States > California (0.14)
Asia > Middle East > Jordan (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.48)

Industry:

Leisure & Entertainment > Games (0.67)
Education > Educational Setting > Online (0.60)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.60)

Add feedback

NeurIPS2021_emergent_group_communication (7).pdf

Neural Information Processing SystemsFeb-9-2026, 23:13:41 GMT

We generate 128,000 images as agents' observations using python's matplotlib library Hunter [2007] V ariational autoencoder [Kingma and Welling, 2014] is used to encode the observations. Input is flatted 30,720-dimensional vector (32 by 320 by 3). Both encoder and decoder have one hidden layer with the dimension size being 1,024. The output (communication message) is a 10-dimensional vector. ReLU is used as the activation function.

agent, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.99)

Add feedback