AITopics | Agents

To this end, we use kernel-based regularity assumptions to capture and exploit the structure in the opponent's response. We propose a novel algorithm for the learner when playing against an adversarial sequence of opponents.

artificial intelligence, machine learning, opponent, (15 more...)

Neural Information Processing Systems

Country:

Europe (0.28)
North America (0.28)

Industry:

Transportation > Infrastructure & Services (0.68)
Leisure & Entertainment > Games (0.47)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.96)

Add feedback

No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium Andrea Celli

Neural Information Processing SystemsOct-9-2025, 14:26:54 GMT

Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium.

artificial intelligence, infoset, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Industry: Leisure & Entertainment > Games (0.47)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Add feedback

07a9d3fed4c5ea6b17e80258dee231fa-AuthorFeedback.pdf

Neural Information Processing SystemsOct-9-2025, 13:06:59 GMT

agent, artificial intelligence, intrinsic reward, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.51)

Add feedback

fce2d8a485746f76aac7b5650db2679d-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 12:38:22 GMT

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Puerto Rico > San Juan > San Juan (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Promising Solution (0.46)

Industry:

Leisure & Entertainment > Games (0.68)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

f3f2ff9579ba6deeb89caa2fe1f0b99c-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 11:47:05 GMT

artificial intelligence, cql, machine learning, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning Jianzhun Shao, Y un Qu

Neural Information Processing SystemsOct-9-2025, 11:47:02 GMT

MARL in real scenarios is still challenging due to the same safety and efficiency concerns in single-agent setting, then it is worth conducting investigation for offline RL in multi-agent setting.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Filters

Collaborating Authors

Agents

f968fdc88852a4a3a27a81fe3f57bfc5-AuthorFeedback.pdf

73a427badebe0e32caa2e1fc7530b7f3-Supplemental.pdf

6dfe08eda761bd321f8a9b239f6f4ec3-Paper.pdf

66de6afdfb5fb3c21d0e3b5c3226bf00-Paper.pdf

Learning to Play Sequential Games versus Unknown Opponents

No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium Andrea Celli

07a9d3fed4c5ea6b17e80258dee231fa-AuthorFeedback.pdf

fce2d8a485746f76aac7b5650db2679d-Paper-Conference.pdf

f3f2ff9579ba6deeb89caa2fe1f0b99c-Supplemental-Conference.pdf

Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning Jianzhun Shao, Y un Qu