AITopics | Agents

Near-OptimalNo-RegretLearningDynamicsfor GeneralConvexGames

Neural Information Processing SystemsFeb-13-2026, 05:02:42 GMT

A recent line of work has established uncoupled learning dynamics such that, when employed by all players in a game, each player's regret after T repetitions grows polylogarithmically in T, an exponential improvement over the traditional guarantees within the no-regret framework. However, so far these results have only been limited to certain classes of games with structured strategy spaces--such as normal-form and extensive-form games. The question as to whether O(polylogT) regret bounds can be obtained for general convex and compact strategy sets--which occur in many fundamental models in economics and multiagent systems--while retaining efficient strategy updates is an importantquestion.

artificial intelligence, machine learning, normal-form game, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.55)

Add feedback

Learning to Play With Intrinsically-Motivated, Self-Aware Agents

Nick Haber, Damian Mrowca, Stephanie Wang, Li F. Fei-Fei, Daniel L. Yamins

Neural Information Processing SystemsFeb-13-2026, 05:02:27 GMT

Neural Information Processing Systems http://nips.cc/

agent, intrinsic motivation, learning, (13 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.04)
North America > United States > California > Santa Clara County > Stanford (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(2 more...)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
(3 more...)

Add feedback

Object-Oriented Dynamics Predictor

Guangxiang Zhu, Zhiao Huang, Chongjie Zhang

Neural Information Processing SystemsFeb-13-2026, 04:25:15 GMT

Neural Information Processing Systems http://nips.cc/

learning, neural information processing system, prediction, (10 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

ExplicablePolicySearch

Neural Information Processing SystemsFeb-13-2026, 03:28:26 GMT

Human teammates often form conscious andsubconscious expectations ofeach other during interaction. Teaming success is contingent on whether such expectations can be met. Similarly,for an intelligent agent tooperate beside ahuman, it must consider the human's expectation of its behavior. Disregarding such expectations can lead to the loss of trust and degraded team performance. A key challenge here is that the human's expectation may not align with the agent's optimal behavior,e.g., duetothehuman'spartial orinaccurate understanding of thetaskdomain.

agent, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.55)

Add feedback

Importance Resamplingfor Off-policy Prediction

Neural Information Processing SystemsFeb-13-2026, 03:27:16 GMT

Thoughunbiased, IScanbehigh-variance. Alowervariancealternativeis Weighted IS (WIS). Figure 4: Learning Ratesensitivityplotsinthe Random Walk Markov Chain, withbuffersizen = 15000 andmini-batchsizek = 16.

artificial intelligence, machine learning, reinforcement learning, (10 more...)

Neural Information Processing Systems

Country: